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1 Introduction 

Time and space complexities are the main properties of algorithms. Their counterparts 
in Group Theory are the Dehn and filling length (or space) functions of finitely presented 
groups. In this paper, we study the interrelation of space functions of groups and the 
space complexity of the algorithmic word problem in groups. 

Let G = (A | R) be a group presentation, where A is a set of generators and R is a 
set of defining relators. Recall that relators belong to the free group with basis A, and a 
group word w in generators A (i.e., a word over A ±l ) represents the identity of G iff there 
is a rewriting 

W = Wo —> W\ —)■••• —> w t -i — > w t = 1 (1.1) 

where 1 is the empty word, the sign = is used for the letter-by-letter equality of words, 
and for every i = 1,. . . ,t, the word Wi results from u>j_i after application of one of the 
elementary ^-transformations. As such transformations one can take free reductions of 
sub words aa^ 1 — > 1 (a G A ±x ), removing subwords r , where r G R, and the inverse 
t r ansf ormat ions . 

The minimal non- decreasing function f(n) : N — > N such that for every word w vanish- 
ing in G and having length \ \w\ \ < n, there exists a rewriting (11. ip with t < f(n), is called 
the Dehn function of the presentation G = (A | R) |T3] . For finitely presented groups 
(i.e., both sets A and R are finite) Dehn functions are usually taken up to equivalence 
to get rid of the dependence of a finite presentation for G (see [16J). To introduce this 
equivalence ~, we write / ^ g if there is a positive integer c such that 

f(n) < cg(cn) + cn for any nGN (1-2) 

For example, we say that a function / is polynomial if / ^ g for a polynomial g. From 
now we use the following equivalence for non- decreasing functions / and g on N. 



f~g if both f <g and g ■< f (1.3) 

It is not difficult to see that the Dehn function f(n) of a finitely presented group 
G is recursive (or bounded from above by a recursive function) iff the word problem is 
algorithmically decidable for G (see [11], [6]). In this case, the word problem can be 
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solved by a primitive algorithm that, given a word w of length n, just checks if there 
exists a sequence (II. ip of length < /(n). Therefore the nondeterministic time complexity 
of the word problem in G is bounded from above by f(n). Moreover if if is a finitely 
generated subgroup of G, then one can use the rewriting procedure (11. ip for H, and so 
the nondeterministic time complexity of the word problem for H is also bounded by f{n). 

It turns out that a converse statement is also true. Assume that the word problem can 
be solved in a finitely generated group if by a nondeterministic Turing machine (NTM) 
with time function T(n). Then G is a subgroup of a finitely presented group G with Dehn 
function equivalent to n 2 T(n 2 ) 4 [I]. As the main corollary, one concludes that the word 
problem of a finitely generated group H has time complexity of class NP (i.e., there 
exists a non-deterministic algorithm of polynomial time complexity, that solves the word 
problem for H) iff if is a subgroup of a finitely presented group with polynomial Dehn 
function. 

We want to obtain a similar statements for space functions. It is clear that to perform 
the rewriting (II. ip one needs the space equal to max <j<t \\wi\\, and this observation 
leads to the definition of space function of a finitely presented group. However when 
handling groups, one can enlarge the set of elementary ^-transformations and obtain 
different definitions of space functions. One can either consider only the transformations 
we defined above and obtain the filling length functions introduced by Gromov [TS] (also 
see [12], [3]), or one can also allow to replace words by their cyclic permutations as it was 
suggested by Bridson and Riley [7J, or can add the replacement of a word w = uv by the 
pair (u, v) if both u and v are trivial in G (see [7J again). Starting with this different sets 
of transformations one comes to different space functions called in [7J, respectively, filling 
length function {FL), free filling length functions (FFL), and fragmenting free filling 
length functions (FFFL). Each of these functions has a visual geometric interpretation 
in terms of the transformations of loops in the Cayley complex of G using, respectively, 
null-homotopy, free null-homotopy, and free null-homotopy with bifurcations. It is proved 
in [7J that these functions behave differently for the same finitely presented group G, for 
instance, FFFL can grow linearly while FL and FFL have exponential growth. There 
are many other features of these functions presented in [7J to justify their "inclusion in 
the pantheon of filling invariants" . 

In the paper, we choose the third version (FFFL), and this choice is justified by 
the theorems on the connection of such functions to the space complexity of the word 
problem for groupsJl] Thus we operate with finite sequences of words W = (wi, . . . ,w s ) 
over a group alphabet A. Given a finitely presented group G, we say that a finite sequence 
W = (w[, . . . ,w' s ,) results from W after application of an elementary R- transformation 
if s' E {s — 1, s, s + 1} and one of the following is done for some Wi {i = 1, . . . , s): 

• a subword aa~ l is removed from or inserted to Wi (a G 

• a subword r or r _1 is removed from or inserted to Wi (r £ R); 

• Wi is replaced by a cyclic conjugate; 

• Wi = uv, and Wi is replaced by the pair u, v, i.e. W = (w\, . . . , Wi-i, u, v, u>i+i, . . . , w s ); 

• Wi is removed if it is empty, i.e. W = (wi, . . . , u>j_i, w i+ i, . . . , w s ). 

1 An embedding statement for the FL-functions was conjectured by J.-C. Birget in [3]- 
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Clearly, we have w — 1 in the group G iff there exists an i?-rewriting starting with (w) 
and ending with the empty string ( ). 

For every finite sequence W = (wi, . . . ,w s ) we set \\W\\ = Yli=i \\ w i\\- By definition, 
the space of a rewriting Wo —»•••—>■ Wt is max* =0 ||Wt||. If a word vanishes in G, then 
space(w) = spacec{w) is the minimum of spaces of all rewritings starting with (w) and 
ending with the empty string. The space function of the group presentation G = (A | R) 
(or briefly, of the group G) is the function 

Sc{ n ) — max(space(w), where w = 1 in G and \\w\ \ < n) 

The space functions of finitely presented groups will be regarded up to the equivalence 
defined by (11.21) and (11.31) . and so their growth will be at least linear. It is observed in [7J 
that the equivalence class of Sg does not depend on a finite presentation of the group G, 
moreover this class is invariant under quasi-isometries. 

An accurate definition of the space function f(n) for a Turing machine (TM) will be 
recalled in Subsection 12.11 Now we just note that it is usual that for a multi-tape TM, 
the function f(n) counts only the space of work tapes. However since the space functions 
of machines are taken here up to the same equivalence as the space functions of groups, 
the adding of the space of the input tape does not change the equivalence class. 

The sequence W —)■•••—>■ Wt can be easily produced by an NTM such that the 
computation needs at most 2max' =1 ||Wj|| + const tape squares. (See also Section 3 of 
[25] or Remark 2.4 in [7].) This immediately implies 

Proposition 1.1. The space function of a finitely presented group G is equivalent to 
the space function of a non- deterministic two-tape TM. The language accepted by this 
machine coincides with the set of words equal to 1 in the group. 

In particular, the non-deterministic space complexity of the word problem in a finitely 
presented group G does not exceed the space function of G. It follows from [16], [8] 
that the literally converse statement fails. Moreover, a counter-example can be given by 
Baumslag's 1-relator group [lj G = (a,b \ (a6a _1 )6(a6a _1 ) _1 = b 2 ) because the space 
function of G is not bounded from above by any multi-exponential function (see papers of 
S.Gersten [10] and A.Platonov [23J) while the space complexity of the word problem for 
G is at most exponential as this was proved by M. Kapovich and Schupp (unpublished), 
moreover, it is polynomial (announced by A. G. Miasnikov, A. Ushakov, and Dong Wook 
Won). The correct formulation has to take into consideration that the algorithm from 
Proposition 11.11 solves the word problem not only for G but also for every finitely generated 
subgroup of the group G. In the deterministic case we get a sharper formulation: 

Theorem 1.2. Let H be a finitely generated group such that the word problem for H is 
decidable by a deterministic TM (DTM) with space function f{n). Then H is a subgroup 
of a finitely presented group G with space function equivalent to f{n). 

The main corollary of this theorem applies to polynomial space complexity. We say that 
a finitely generated group G belongs to the class PSPACE (to NPSPACE) if the word 
problem for G is decidable by some DTM (some NTM) with a polynomial space function. 
But NPSPACE = PSPACE by remarkable Savitch's theorem (see [9], Corollary 1.31) 
which contrasts with deterministic time complexity having therefore no natural algebraic 
counterpart. Proposition 11.11 and Theorem 11.21 eliminate any non-determinism in 
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Corollary 1.3. A finitely generated group H belongs to PSPACE iff H is a subgroup 
of a finitely presented group G having polynomial space function. 

Thus, given a 'good' algorithm solving the word problem in H (e.g., using a matrix 
representation of H, etc.), it is possible to find a bigger group G whose deterministically 
modified ('silly') natural algorithm solves the word problem for both H and G, and whose 
space funtion is not much worse than the space function of the original algorithm. 

Another natural question raised in our paper is the realization problem: Which func- 
tions f{n): N — > N are, up to equivalence, the space functions of finitely presented groups? 
There are not many examples; linear and exponential ones can be found in [7J, but it is 
not easy even to point out a group with space function n 2 . 

Theorem 1.4. Every space function f(n) of a DTM is equivalent to a space function of 
some finitely presented group. 

This theorem gives a tremendous class of space functions for groups, including functions 
equivalent to [exp-y/n], [n k ] (k G N), [n k \og l n], [n k log' (log log n) m ], etc. 

The main theorem implies a non-deterministic corollary. To formulate it we recall that 
a function s : N — > N is called fully space- constructible (FSC) if there exists a two-tape 
DTM that on any input x of length n halts visiting exactly s(n) tape squares of the work 
tape. Most common functions are FSC (see [9]). 

Corollary 1.5. Let H be a finitely generated group such that the word problem for H is 
decidable by an NTM having an FSC space function f{n). Then H is a subgroup of a 
finitely presented group G with space function equivalent to f(n) 2 . 

Finally, we describe the functions n a which are (up to equivalence) the space functions 
of groups. For this aid, we modify the proof of Savitch's theorem from [9] and the approach 
from |25j, where the similar problem was considered for Dehn functions if a > 4, and close 
necessary and sufficient conditions were obtained. (See also a dense series of examples 
with a > 2 presented by Brady and Bridson [5] . ) Now we have a > 1 in Corollary 11.61 
below. Also it is remarkable that for space functions the necessary and sufficient conditions 
just coincide. To formulate the criterion, we call a real number a computable with space 
< f(m), if there exists a DTM which, given a natural number m, computes a binary 
rational approximation of a with an error 0(2~ m ), and the space of this computation 
</(m). 

We have got the following criterion. 

Corollary 1.6. For a real number a > 1, the function [n a ] is equivalent to the space 
function of a finitely presented group iff a is computable with space < 2 2 "\ 

It follows that functions [n n ], [n^], and [n a ] with any algebraic a > 1 are all the space 
functions of finitely presented groups. 

The space function is defined for a simply connected geodesic metric space under 
some weak restrictions, in particular, for the universal cover of any closed connected 
Riemannian manifold. (See [7J for details; we just note here that to define the space (= 
FFFL) function, one should consider free homotopy with possibility of separating of a 
loop in two loops in a bifurcation point.) It is proved in [7J (Theorem E) that if a finitely 
presented group G acts properly and cocompactly by isometries on such a space X, then 
the space function of G is equivalent to the space function of X. Since every finitely 
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presented group is a fundamental group of a connected compact Riemannian manifold, 
we can use corollaries 11.41 and 11.61 and formulate one more 

Corollary 1.7. For every space function f(n) of a DTM, there exists a closed connected 
and simply connected Riemannian manifold M (with a properly cocompact action of a 
finitely presented group on it) such that the space function of M is equivalent to f(n). 

In particular, if a real number a > 1 is computable with space < 2 2 " 1 , then there exists 
such a manifold with space function equivalent to n a . 

To some extent, our constructions can be traced back to the works of P.Novikov, Boon, 
Britton and other authors who invented group-theoretical interpretation of TM (see [23] , 
ch. 12). The hub relation copies the accept configuration of a machine several times. 
Using the language of van Kampen diagram, we correspond to every computation, a disc 
surrounding the hub cell, and this disc has a number of similar sectors. First of all in 
the present paper, we are to estimate the sizes of computational discs, and to do this one 
should know the generalized space function of a machine which estimates the space of 
computations starting with arbitrary accept configuration, not only with input ones. So 
we should modify the initial machine to be able to control the generalized space function. 
(The modification from [25] helps to control the time function but corrupts the space 
function.) 

The next modification is due to the symmetry of algebraic relations: since u = v always 
implies v = u, the algebraic version of a machine M always interprets the symmetrization 
of M. Thus we concerns that the symmetrization preserves the basic characteristics, e.g. 
the accepted language and the space functions. We are able to do this only if the initial 
machine is deterministic or can be transformed to a deterministic under the control of 
basic properties. (The known symmetrization trick from [2] or |25j does not work here 
since it does not preserve the space function.) This causes the restrictions in formulations 
of Theorem 11.21 and Corollary 11.51 

The interpretation problem for groups remains much harder than for semigroups even 
after the adaptation of the machine because the group theoretic simulation can execute 
unforeseen computations with non-positive words. Boon and Novikov secured the posi- 
tiveness of admissible configurations in discs with help of an additional 'quadratic letter' 
(see [24], ch.12), but this involves a difficult control of parameters for the constructed 
group. A new approach was suggested in [25] . Invented by Sapir S'-machines can work 
with non-positive words on the tapes. Here we also construct an S'-machine which is a 
somewhat modified composition of a convenient Turing machine with an 'adding machine' 
Z(A) introduced in [22] . Fortunately, Z(A) does not change the space of computations 
but controls positiveness of configurations. 

Recall that we should not just simulate the work of a machine but construct an em- 
bedding of given group H into a finitely presented group G in the spirit of the Higman 
Embedding Theorem, (see [23], ch.13). For this aid, we use a version of the two-disc 
scheme presented in the survey [TH] and first applied in [21j . Simplifying, one can say 
that the configurations on the boundary of discs of the first type are longer than the 
words written on the boundary of corresponding discs of the second type, and the sur- 
pluses are relations of the group H. Since every relation of H holds in G, we obtain a 
homomorphism H — >■ G that turns out injective. 

To estimate the space function of the group G we use van Kampen diagrams and 
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induct on the number of hubs. The basic trouble is to cut a diagram A having at least 
two hubs into two subdiagrams with hubs, so that the perimeters of the subdiagrams 
do not exceed the perimeter of A. We have to introduce a new metric where the length 
of a word depends on syllable factorization of it. To find a short cut we use the mirror 
symmetry of sectors in discs, but unfortunately for the two-disc scheme, one of the sectors 
has no (mirror) copies, which creates technical obstacles!! We study both exact and non- 
accurate copying for various types of complete and incomplete sectors. A number of 
concepts, e.g. bands, trapezia, discs were incorporated in the algorithmic group theory 
early, in papers [25J, [T5], [I] and subsequent ones, and we reproduce them in Section [3] 
(and partly in sections [2] and H|) in the form they are used now. Some others (replica, 
unfinished diagram, simple disc) are new. 

We mean to consider similar problems for semigroups where the simulating of machines 
is easier, and for the definition of space function, one does not need cyclic shifts and 
fragmentation of words. 

2 Machines 
2.1 Definitions 

We will use a model of TM which is close to that in [22]. Recall that a (multi-tape) TM 
has k tapes and k heads. One can view it as a tuple 

M=(X,Y,Q,Q,s u s ) 

where X is the input alphabet, Y = U^ =1 Fj is the tape alphabet, Y 1 D X, Q = uf[ =1 Qi 
is the set of states of the heads of the machine, G is a set of transitions (commands), s*i 
is the fc-vector of start states, s is the fc-vector of accept states. (U denotes the disjoint 
union.) The sets X, Y, Q, 9 are finite. 

We assume that in the normal situation the machine starts working with states of the 
heads forming the vector s±, with the head placed at the right end of each tape, and 
accepts if it reaches the state vector s*o. In general, the machine can be turned on in any 
configuration and turned off at any time. 

A configuration of a tape number i of a TM is a word uqv where q G Qi is the current 
state of the head, u is the word to the left of the head, and v is the word to the right of 
the head. A tape is empty if u, v are empty words. 

A configuration U of a TM is a word 

a 1 U 1 u 1 a 2 U 2 uJ2---akUk^k 

where Ui is the configuration of tape i, and a i: uJi are special separating symbols. For 
unification of notation, we shall treat at^u^ as heads of the machine too. These heads 
correspond to tapes that are always empty and do not change during a computation. 

An input configuration is a configuration where all tapes except the first one are empty, 
the configuration of the first tape (let us call it the input tape) is of the form uq, q G Qi, 
u is a word in the alphabet X, and the states form the start vector s±. The accept 
configuration is the configuration where the state vector is s*o, the accept vector of the 

2 Note that one can give shorter proofs of Theorem 11.41 and Corollary 11,61 which do not need the two-disc scheme; but 
here we just obtain these results after Theorem 1 1.2 1 is proved. 
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machine, and all tapes are empty. (The requirement that the tapes must be empty is 
often removed for auxiliary machines which are used as parts in constructions of bigger 
machines.) 

To every G 9 we correspond a command (marked by the same letter 9), i.e., a pair 
of sequences of words [Vi, Vk] and [V/, V[\ such that for each j < k, either both Vj 
and Vj are configurations of the tape number j or Vj = a^q and V- = atjq' (q, q' G Qj), or 
Vj = qujj and V- = q'ujj (q,q' G Qj ). 

In order to execute this command, the machine checks if Vi is a subword of the config- 
uration of tape % for each % < k and if this condition holds the machine replaces Vi by V- 
for all % = 1, . . . , k. 

Let we have a sequence of configurations Wo,...,w t and a word h = 9\ . . . 9 t in the 
alphabet 6, such that for every i — 1, t the machine passes from Wi-i to Wi by applying 
the command 6{. Then the sequence (wq — >■ W\ —>••-• — >■ w t ) is said to be a computation 
with history h. In this case we shall write wq ° h = Wt- The number t will be called the 
time or length of the computation. 

A configuration w is called accepted by a machine M if there exists at least one com- 
putation which starts with w and ends with the accept configuration. We do not only 
consider deterministic TM, for example, we allow several transitions with the same left 
side. Moreover, for non-deterministic TM, we allow in this paper, to correspond identi- 
cally equal executions to different symbols 9,9' G 0. 

A word u in the input alphabet X is said to be accepted by the machine if the corre- 
sponding input configuration is accepted. The set of all accepted words over the alphabet 
X is called the language Cm recognized by the machine M. 

Let \wi\ a (i = 0, ...,t) be the number of tape letters (or tape squares) in the configura- 
tion Wi. (As in [23], the tape letters are called a-letters.) Then the maximum of all \wi\ a 
will be called the space of computation C : w — » w\ —)■•••—)■ w t and will be denoted by 
spaceMiC). By spaceMiw), we denote the minimal natural number s such that there is 
an accepted computation of space at most s, starting with the configuration w. If u G L M , 
then, by definition, spaceM{u) is the space of the corresponding input configuration w. 

The number S(n) = S'jvf(n) is the minimum of the numbers space(u) over all words 
u G Cm, with ||w|| < n. The function S(n) will be called the space function of the Turing 
machine. 

The space of a computation and the space function of M are defined similarly, but one 
does not count the a-letters on the input tape. 

The definitions of the generalized space function S'(n) = S' M (n) is also similar to the 
definition of space function but we consider arbitrary accepted configurations w with 
\w\ a = n, not just the input configurations as in the definitions of S(n). It is clear that 
S(n) < S'{n). 

To obtain the definitions of timeM{w), timeM{u), time function Tm{ti) and generalized 
time function T' M {n), one should replace 'space' for 'time' in the previous definitions. 

Given an NTM M, one can add additional states and two more commands so that only 
input configurations involve the state letters from S\ and only one command applicable 
to the input configurations, and there is a unique accept configuration s with a unique 
accepting command. In this case we will say that the machine satisfies the s\q- condition. 
This assumption changes neither the language C no the functions Sm(ji) and S' M (n). 
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2.2 Machines with equivalent space and generalized space functions 

In this subsection, we construct an NTM M 2 which depends on an NTM Mi, and prove 
Lemma 12. 11 

Let an NTM Mi have k tapes, and the first tape of it be the input tape. Then we 
add a tape number k + 1, which is empty for input configurations, and organize the work 
of the 3-stage machine M 2 as a sequential work of the following machines M 2 i, M 22 , and 
M 23 . 

The machine M 21 uses only one command that does not change states and adds one 
square with an auxiliary letter * to the (k + l)-th tape. M 21 can execute this command 
arbitrary many times while the tapes number 1, . . . , k leave unchanged the copy of an 
input configuration of Mi. Then a connecting rule 9i 2 changes all states of the heads and 
switches on the machine M 22 . 

The work of M 22 on the tapes with numbers 1, . . . , k copies the work of Mi, but we 
extend every command 9 of Mi to the (k + l)-th tape as 9' so that an application of 9' 
does not change the current space, that is, if the application of 9 inserts mi tape squares 
and deletes m 2 tape squares, then 9' inserts m 2 -m! (deletes m 1 — m 2 ) squares with letter 
* on the (k + l)-th tape if mi — m 2 < ( if mi — m 2 > ). However one cannot apply 9' 
if mi — m 2 exceeds the current number of squares on the tape number k+1. 

The connecting command 9 23 is applicable when M 22 obtains the accept configuration 
on the first k tapes. It changes the states and switches on the machine M23 erasing one 
by one all squares on the {k + l)-th tape, and then M 2 accepts. (The tape alphabet of 
M 2 3 has only one letter *.) 

Let w be a configuration of the machine M 2 such that woj, is defined, or w be obtained 
after an application of the connecting command 9i 2 . Then we have an input configuration 
on the tapes with numbers 1, . . . , k (plus several *-s on the (k + l)-th tape). We will 
denote by u(w) the input word u written on the first tape. It is an input word for the 
machine Mi as well, and if it is accepted by Mi, the expression spaceMiU^w) makes sense. 

The connecting commands 9i 2 and 9 23 are not invertible in M 2 by definition. Therefore 
every non-empty accepting computation of M 2 has history of the form hih 2 hs, or h 2 h^, 
or hs, where hi is the history for M 2 i, (I = 1, 2, 3). (To simplify notation we can attribute 
the command 9i 2 (the command #23) to h 2 (to ^3).) 

Lemma 2.1. (a) The machines Mi and M 2 recognize the same language C. (b) The space 
function SM 2 { n ) an d the generalized space function S' M2 (n) of M 2 are both equivalent to 
Sm^k)- (c) If w is an accepted configuration M 2 , and the command 9* is applicable to w, 
then spaceM 2 ( w ) — max(spaceMi(^(w)), \w\ a ). 

Proof. Assume that u G £ = £m x - Then u G £' = £m 2 because machine M 2i can 
insert sufficiently many squares (equal to the spaceM 1 { u ) ~ I Ml) so that the accepting 
computation of Mi can be simulated by M 22 . Also it is clear from the definition of M 2 , 
that every accepting computation for M 2 having a history hih 2 h 3 as above, simulates, 
at stage 2, an accepting computation of Mi with history h 2 . Therefore £' = £ and 
SM^n) = S M2 (n). 

Assume now that C : w = Wo — > ■ ■ ■ — > w n is an accepting computation of M 2 with 
spaceM 2 {C) = spaceM 2 {w) and h = hih 2 h 3 is the history with the above factorization (hi 
or hih 2 can be empty here). If the word hi is empty, then ||tfo|| > ■ ■ • > H w n|| by the 
definition of the machines M 22 and M 32 . Hence the space of this computation is equal to 
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I w | a El- Then let hi be non-empty. It follows that the machine M 2 starts working with a 
copy of an input configuration of the machine M±, i.e., the input tape of this configuration 
contains an input word u = u(w), and the additional (k + l)-th tape has m squares for 
some m > 0. Moreover u G C since the computation of M 2 is accepting. We consider two 
cases. 

Case 1. Suppose m > spaceM 1 {u) ~ I Ml- This inequality says that the additional 
tape has enough squares to enable M 22 simulating the accepting computation of Mi with 
the input word u. Hence there is an M 2 -computation wq —>■••• — >■ w n > with history of the 
form h' 2 h' 3 , and so its space, as well as the space of our original accepting computation is 

Ma- 
Case 2. Suppose m < space m 1 (u) — | \u\ |. Then there is a computation wo — > ■ ■ ■ — > w n i 
such that the commands of the M 2 i-stage of it insert squares until the total number of 
the squares of the {k + l)-th tape becomes equal to space m^u) — |H|, and then the 
machines M 2 i and M 2 3 work in their standard manner. The space of this (and the 
original) computation is space m^u). 

The estimates obtained in cases 1 and 2 prove the statement (c) of the lemma. They 
also show that 

SmM) = SM 2 {n) < S' M2 (n) < max(S Ml {n),n)) ~ S Ml (n) 
and the statement (b) is completely proved too. □ 

2.3 Symmetric machines 

For every command 9 of a TM, given by the vector [Vi — >■ V{, . . . , — > V£\, the vector 
\V{ — Y Vi, . . . , VI — > \4] gives also a command of some TM. These two commands 9 and 
9~ 1 are called mutually inverse. 

From now we will assume that the machine Mi we started in Subsection 12.21 is a DTM 
and satisfies the Si -condition. 

Since the machine Mi is deterministic, we have no invertible commands of the machine 
M 2 . The definition of the symmetric machine M 3 = M 2 ym is the following. Suppose M 2 = 
(X, Y, Q, 9, si, s ). Then by definition, Mf m = (X, Y, Q, 9^ m , si, s >, where Q sym is the 
minimal symmetric set containing 0, that is, with every command [Vi — > V{, . . . , 14 + i — >■ 
Vl +1 ] it contains the inverse command [V{ — > Vi,...,V^' +1 — > Vjt+i]; in other words, 

Qsym = 0+ U Q- where 0+ = © an d 0- = {0-1 | e 0}. 

A computation wq —)■•••—)■ wt of M3 (or of other machine) is called reducible if its 
history is a reduced word. If the history h = 9i . . . 9 t contains a subword 6^+1, where the 
commands 0j and 9 i+ i are mutually inverse, then obviously there is a shorter computation 
w —>■••• — >■ Wi-x = w i+ i — > ■ ■ ■ -> w ( whose space does not exceed the space of the 
original computation. 

Lemma 2.2. Let w = w — > wi — > ■ ■ ■ — > w t be an accepted reduced computation of the 
machine M 3 , and the command be applicable to w. Then 

(a) the word u(w) belongs to the language £ recognized by Mi and 

(b) the space of this computation is at least space m 1 (u(w)). 

3 Here and further we keep in mind that the difference — |ui^| a is a constant for any computation. 



9 



Proof. Let h = 9(1) . . . 9(t) be the history of the computation. If for some i, tUj + i = 
Wi o 9(i + 1) where 9(i + 1) = # 23 or 9(i + l) 1 *" 1 is a command of M 23 , then one can modify 
our accepted computation so that, for j > % + 1, every command 9(f) is a command of 
M23 and < Hence we may assume that h has exactly one letter 623 followed 

by the commands of M 23 only, and h = h TihiT 2 . . . r s h s , where r s = 623, Tj = 9 12 

for i < s, and the subwords /ij-s contain no connecting commands. We may also assume 

that the subword ho is empty since the command 0* does not change the sub word u(w). 

Since, by the sio-condition, only one command of the machine M\ (and of its analog 
M22) accepts, the last command of h s ^\ is this unique command of M22, and so this last 
command is positive. Therefore if /i s _i contains a letter 9" 1 , where 9 is a command of 
M 2 2, then h has a 2-letter subword 9^92, where both 9\ and 9 2 are commands of M 2 2- 
Hence there is a configuration Wi such that both 9\ and #2 axe applicable to Wi. This 
is impossible since the machine Mi is deterministic and the history h s _i is a reduced 
word. Therefore h s _i is entirely the history of a computation of M 2 2, the computation 
Wi s _ 1 —»■•••—»■ Wt with history r s ^ih s ^iT s h s = 9i 2 h s ^i923h s is an accepted computation 
of M 2 , the word u(wj s _J belongs to the language £, and the space of the computation 
Wi s _ 1 —>■•■■ w 4 is at least spaceMiW^wJ) by Lemma [2TT1 (c). 

Assume, by induction on j, that ) belongs to £ for j > 1, where the computation 

w i s -j ~ * ' ' ' ~ * w t nas history T s _jh s _j . . .r s h s , and the space of this computation is at 
least space Ml (u{w is _.)). 

Then the word iy» s _ ._ 1 has similar properties if h s _j^i consists of the commands of M 2 i 
or their inverses since these commands do not change the content of the tapes number 
l,...,k. Otherwise the commands of are commands of M22 (and inverses), and 

since this machine is deterministic, the word h s -j-i has no subwords 9± l 92 with positive 
9\ and 9 2 . Therefore we have h s -j-i = g'g"^ 1 , where both g' and g" are (positive) histories 
of M 2 2-computations. This implies the equality ._ a o r s _j_i) o g' = Wi s ^ j o g" . Since 
the commands 9f 2 do not change u(Wi)-s, we have W is _ j _ 1 og' — W is _ j op", where W is _ j _ 1 
and Wi a _. are the input configurations for the machine M\ with inputs u(wi s _ :j _ l ) and 
w(u'i s _ j ), respectively. (Here we use identical letters for the corresponding commands of 
Mi and M 22 .) 

The machine Mi is deterministic, and so the accepted computation for Wi s _ j must look 
like Wi s _. Wi s _ 3 o g" — y . . . , and consequently, the configuration W ia _ j o g" is 

accepted by Mi. Therefore we can construct the accepted computation Wi s _ j _ 1 —>■•••—)■ 
W ia _ j _ 1 og' = Wi s _. og" — y . . . for M 1; and so the word u(w is belongs to £, as desired. 

The constructed accepted computation of Mi is decomposed in two parts. It follows 
from the definition of M 2 2 that the space of the first part is majorized by the space of the 
M 3 -computation Wi s .j —>••••—>■ w ia _ j _ 1 og' which is a part of the computation Wi s _ j _ 1 — )■ 
■ ■ ■ — > Wt- The second part is a part of the deterministic accepted Mi-computation with 
input u(wi s -), and so, by the inductive hypothesis, the space of this part does not exceed 
the space of the M 3 -computation Wi - —>••••—)■ Wt- Hence spaceM 1 (u(uii a _ J _ 1 )) does not 
exceed the space of the M 3 -computation w ia _ j _ 1 —)■•••—)■ w t . 

Since w = , the lemma is proved by induction on j. □ 

Lemma 2.3. The machines M\ and M 3 recognize the same language. The generalized 
space functions S' M2 (n) and S' Ma (n) are equivalent. 

Proof. We recall that every computation of the machine M 2 is also a computation of M 3 . 
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Therefore the first statement follows from lemmas 12.11 (a) and 12.21 (a). 

To prove the second part, it suffices to prove that for every accepted configuration w 
of M 2 (of M 3 ), there is an accepted configuration w' of M 3 (of M 2 ) such that \ \w'\ \ < \ \w\\ 
but spaceM 3 (w') > spaceM 2 (w) (respectively, spaceM 2 (w') > spaceM 3 (w)). 

(1) Consider an accepted computation w = wo — > ■ ■ ■ — > w t of M 2 whose space is equal 
to spaceM 2 ( w )- If th e first command of this computation is not a command of M21, then 
H^oll > I l^i 1 1 > • • " > \\ w t\\, and therefore spaceM 2 (w) — Ma < spaceM 3 (w), and so one 
can choose w' = w. If the first command is a command of M 21 , then by lemmas 12.1( c) 
and 12.21 (b), we have spaceM 2 {w) = max(spaceMi (u(w)), \w\ a ) < spaceM 3 (w), and again 
w' = w. 

(2) Now consider a reduced accepted computation w = wq — > ■ ■ ■ — > w t of M 3 whose 
space is equal to spaceM 3 {w). If the first command (or the inverse of it) is a command 
of M23, then the commands of the shortest accepted computation with minimal space 
just erase squares. Hence spaceM 3 {w) = \w\ a = spaceM 2 ( , w), and we can choose w' 
equal to w. If the first command is a command of M^f™, then by Lemma 12.21 (a), the 
word w is accepted by M 2 . Since every command of M 2 is a command of M 3 , we have 
space m 3 (u!) < spaceM 2 (w), and again it suffices to set w' = w. 

Thus, we may assume that the first command of our computation (or the inverse) is a 
command of M 22 . Therefore the history h of this computation has a prefix h'h" with non- 
empty h" , where every command of h! is command of M 2 | m and either every command of 
h" (or the inverse command) is a command of M 2 i or every command of h" is a command of 
M 23 . In the latter case, we may assume that h = h'h" and ||wo|| > ll w i|| > • • • > ll^tll; an d 
so spaceM 3 (w) = \w\ a < spaceM 2 {w), and w' = w. In the former case, we set w' = w o hi 
and note that ||w|| = ||u>i|| = ••• = \\w'\\ since the commands of the computation 
w —>•••—)■ w' with history h! do not change the number of tape squares. In particular, 
we have spaceM 3 (w) < space m 3 (w'). Since the command 9* is applicable to w' in this 
case, we have spaceM 3 (w') < spaceM 2 (w') as this was observed in the previous paragraph. 
Therefore space m 3 (w) < spaceM 3 (w') < spaceM 2 {w') , as desired; and the lemma is proved. 

□ 

Lemma 2.4. For every DTM M recognizing a language C and having a space function 
S(n) (for every NTM M recognizing a language C and having an FSC space function 
f(n) ), there exists an NTM M' with the following properties. 

1. The machine M' recognizes the language L. 

2. M' is symmetric. 

3. The space and the generalized space functions of M' are equivalent to S(n) (respec- 
tively, are equivalent to f(n) 2 ). 

4- For every command [Vj —> V{, . . . , Vj, — > V£\ of M' , we have \Vi\a + Yl l^'U — 1; 
i.e., at most one tape letter is involved in the command. 

Proof. Assume that the machine M is deterministic. Then starting with Mi = M, we 
construct machine M 2 described in Subsection 12.21 and machine M 3 = M 2 ym '. Now Lemma 
12.31 implies statement 1, and statement 2 is true since M 3 = M^ vm . Then, by lemmas 12.31 
and 12.11 (b), we have 

S'mM) ~ S M 2 (n) ~ S M2 (n) ~ S Ml (n) = S{n) 
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, and S(n) < SM 3 {n) < S' M , A (n) by Lemma l2"72l (b). and so all these functions are equivalent. 
Finally, we modify M 3 to obtain property 4. For example, if for a one-tape machine we 
have a command aq — > bq', then we introduce a new state letter q" and replace this 
command by two commands aq — > q" and q" — > bq'. It is easy to see that the obtained 
machine M' satisfies property 4 and keeps holding properties 1 — 3 of M3. 

If M is non-deterministic, then we first use that the function f(n) is FSC, and there- 
fore, by Savitch's theorem ([9], Theorem 1.30), there exists a DTM M\ accepting the 
same language L with space function equivalent to f{n) 2 . So the replacement of S{n) by 
f(n) 2 in the previous paragraph provides the proof of the non-deterministic version of the 
lemma. □ 



2.4 S-machines 

Ordinary Turing machines work with positive words and they can see letters on the tape 
near the position where the head is. The command executed by the machine depends not 
only on the state of the head but also on the letter(s) observed by the head. In contrast, 
S-machines introduced in [SBR] work with words in group alphabets and they are almost 
"blind" , i.e., the heads do not observe the tape letters. But the heads can "see" each other 
if there are no tape letters between them. We will use the following precise definition of 
an S-machine S. 

Let A; be a natural number. Consider a language of admissible words. It consists of 
words of the form 

giMig 2 • • -u k q k+ i, 

where qi are letters from disjoint sets Q, t [i = 1, . . . , k + 1), u,- t are reduced words in a 
group alphabet Y it (i.e. every letter a belongs to it together with the inverse letter a -1 ) 
and the sets Y = UYj and Q = L\Qi are finite. The letters from Q are called state letters, 
the letters from Y are tape letters. Notice that in every admissible word, there is exactly 
one representative of each Qi and these representatives appear in this word in the order of 
the indexes of Qi. (i.e., unlike [22], we consider only the regular order of Qi-s in admissible 
words) . 

There is a finite set of commands (or rules) 0. To every 9 G 0, we associate two 
sequences of reduced words from the free group F{Q UY): B{9) = [U%, U k +i], T{6) = 
[V x , Vfc+i], and a subset Y{9) = UY0) of Y, where Yi(6) C Y t . 

The words Ui, Vi satisfy the following restriction: 

(*) For every i — 1, k + 1, the words Ui and Vi have the form 

Ui = v^qiUi, Vi = v' i _ 1 q[u\ 

where q iy q\ G Qi, Ui and u\ are words in the alphabet Yi(9), and v\_ x are words 
in the alphabet F i _i(6 l ). The words v , v' Q , u k+ i, u' k+l are empty. 

Sometimes we will denote the rule 9 by [U\ — > V\, U k +i —> V k+ \]. This notation 
contains no information about the sets Yi{9). In most cases it will be clear what these 
sets are. In the ^-machines used in this paper, the sets Yi(9) will be mostly equal to either 
Y or 0. By default Yi(9) — Fj. 

(■ 

In order to simplify the notation, we will use the notation ViqiUi — > v^Ui for a part of 
a rule when the corresponding Yi(9) is empty (a similar notation has been used in |20j). 
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Every S-rule 9 = [Ui —> V\, Uk+i — > Vk+i] has an inverse _1 = [Vi — > U\, I4 + i — > 
Uk+i]] we set Y^(0 -1 ) = Yi{6). We always divide the set of rules of an S-machine into 
two disjoint parts, G + and G~ such that for every 9 G @ + , 9~ 1 G 0~ and for every 
9 G 0~, 9~ 1 G 6 + . The rules from + (resp. 0~) are called positive (resp. negative). 

An S'-machine is a rewriting system. To apply an S'-rule 9 to an admissible word 
W = qiW\q2 ■ ■ .Wkqk+i means to check if every Wi is a word in the alphabet Yi(9) and 
then, if W satisfies this condition, to replace simultaneously subwords Ui by subwords Vi 
(i = 1, . . . , k + 1). This replacement is allowed to perform in the form qi — > Vi_ 1 v~ 1 q' i u^ 1 Ui 
followed by the reducing of the resulted word. The following convention is important 
in the definition of S-machine: After every application of a rewriting rule, the word is 
automatically reduced. The reducing is not considered a separate step of an S-machine. 

The definitions of computation, its history, input admissible words, the accept word, 
the language of admissible words, space of a computation, space and generalized space 
functions, time and generalized time functions of an S-machine are similar to those for 
TM. (One should replace the word "configuration" by "admissible word" in the defini- 
tions.) 

Although S-machines are usually highly non-deterministic, they better adapted to 
simulating by finitely presented groups than ordinary TM (and moreover, S-machines 
are treated in [22] as HNN-extensions of free group with basis YUQ). On the other hand, 
it is mentioned in [25] that every symmetric NTM M can be viewed as an S-machine 
S(M): just interpret the commands of the Turing machine as S-rules. (For example, the 
part of a rule of the form aq —> bq' is interpreted as q — > a~ 1 bq', and the part of the 

form atjq — > a^q' is interpreted by the pair ctj -4 aj, q — > q'.) Unfortunately, the language 
recognized by S(M) is in general much bigger than the language recognized by M since 
M works with a positive tape alphabet only. Nevertheless the following statement is true: 

Lemma 2.5. (Compare with Prop. J^.lWh^.) Every computation of a symmetric NTM 
M is a computation of S(M) with the same history. If M satisfies property 4 from Lemma 
2.4\ then every positive computation of S(M), i.e., a computation consisting of positive 



words, is a computation of M with the same history. 

Proof. Every positive admissible word W of S(M) is a configuration of the Turing machine 
M. Assume that a rule 9 of S(M) corresponding to a command 9 of M is applicable to 
this W and the word W o9 is positive. Recall that by property 4 of Lemma [231 9 involves 
at most one tape letter (e.g., it cannot replace a tape letter by a tape letter or have a 
part of the form aq — > aq'). Therefore the positiveness of both W and W o 9 implies that 
the application of 9 just coincides with the application of 9. The statement of the lemma 
follows. □ 



2.5 Composition with an adding machine 

Further we use the auxiliary adding S-machine Z(A) from [22]- In [22], the main duty of 
Z(A) was the exponential slowing down of basic computations, while now we will mainly 
use the capacity of Z(A) (observed in Lemma 3.25 (2) |22j) to check whether an admissible 
word is positive or not. 

The tape alphabet of Z(A) consists of an alphabet A ±x and two copies Aq 1 and Af l of 
A ±l while the input alphabet is Aq 1 . The admissible words are of the form LupvR, where 
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u is a reduced word in the alphabet Aq 1 U Af 1 , v is a reduced word in A ±:L , the symbols 
L,p, R, are state letters, the commands do not change L and R, and p G {p(l),p(2),p(3)}. 
The input configurations have form Lup(l)R and the accept ones are of the form Lup(3)R. 
The list of rules is given in subsection 3.6 of [22], but we will not use them here and rather 
formulate, in Lemma 12. 6 [ the required properties obtained in [22]. 

If w a word in the alphabet A^ 1 U Af l , then its projection onto A ±l takes every letter 
to its copy in A. 

Lemma 2.6. The following properties of the machine Z(A) hold. 

(1) Every positive input word u in the alphabet Aq is accepted by a canonical com- 
putation of Z(A) with a positive history and equal lengths of all words appearing in this 
computation. 

(2) For every computation LupvR — w —>•••—)■ w' = Lu'p'v'R ofZ(A), the projections 
of the words uv and u'v' onto A are freely equal. In particular, u — v! if the words v and 
v' are empty and the words u and u' contain no letters from Af 1 . 

(3) If Wo — > ••• — > w t is a reduced computation of Z(A) and \\w \\ < ||u>i||, then 

I \ w l\\ < I \ w 2\ \ <•••<! \Wt\\- 

(4) For every reduced computation wo — > ■ ■ ■ — > w t , we have \\wi\\ < max(| |u> | |, \\u>t\\) 
(i = 0,...,t). 

(5) If w = LupR, where p = p(l) (or p = p(3) ), w = wo — > ■ ■ ■ — > w t is a reduced 
computation, w t contains the subword p(3)R (respectively, p{l)R), and all a-letters of wq 
and w t are from Aq 1 , then u is a positive word and all words of the computation have the 
same length. The length of this computation is at least 2" n ". 

(6) There no reduced computation w = Wq — > ■ ■ ■ — > w t of length t > 1 such that both 
w and w t contain p(l)R or both of them contain p(3)R and all a-letters of w and w t 
belong to A^ 1 . 

Proof. (1) This computation is described in [22], p. 1344. 

(2) This claim is the statement of Lemma 3.18 of [22J. 

(3) This statement is true by Lemma 3.24 of [22J 

(4) , (5) These statements are contained in Lemma 3.25 of [22]. 
(6) This statement is contained in Lemma 3.27 of [22J. 

□ 

We somewhat modify the definition of the composition of a symmetric Turing machine 
M and the adding machine Z(A), given in [22]. The difference is that machines of the 
form Z(A) will work not only after the application of every command of M but before 
applications of commands from M as well. This make possible to simulate the work of any 
symmetric NTM, not only S'-machine as that was done in [22] . So the aim of following 
interbreeding is to obtain an S-machine S which recognize the same language and has 
the same space and generalized space function as the symmetric Turing machine M. The 
S'-machine constructed in [SBR] cannot serve in the present paper since the space and 
the generalized space functions of that machine are equivalent to the time function. 

Consider a symmetric NTM M = (X, Y, Q, 9, s*i, s*o) with Y = U l i=1 Yi, and with the s*io- 
condition. The set is a disjoint union of positive and negative commands: = G + U0~. 
Let S(M) be the associated S'-machine defined before Lemma 12.51 We will assume that 
the admissible words of S(M) are of the form k\Uik 2 . . . Uiki + \, where hi are letters from 
disjoint sets Qi (i = 1, . . . , I + 1), Ui is a reduced words in the alphabet Yj (i < I). 
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To define the composition S = Mo Z of M and Z(A), we will insert a p-letter between 
any two consecutive q- letters ki, A; i+1 in an admissible word of S(M), to be able to treat 
any subword ki...p...k i+1 as an admissible word for a copy of Z{A). In particular, the start 
and the accept words of S are obtained from, respectively, the start and accept words of 
M. 

First, for every % — 1, I, we make two copies of the alphabet Yj of S(M) (i = 1, /): 
Yifi = Yi and Y i X . The set of state letters of the new machine is 

K x U Pi U K 2 U P 2 U ... U?,U K l+1 , 

where P = { Pi , Pi (9, l-), Pi (6, 2-), Pi (6, 3~),p z (9, l + ), Pi (9, 2+), Pi (6, 3+) | 9 G 0+}, % = 
1,..,/. 

The set of state letters is 

y = (Y lfi u Y ltl ) uy,u (y 2)0 u y 2j1 ) u y 2 u ... u (y, u y u ) u y, ; 

the components of this union will be denoted by Yi, Y 2l . 

The set of positive rules G of M o Z is a union of the set of modified positive rules 
of S(M) and of positive rules of Zi(6, — ) + and Zi(9,+) + (9 G ©,« = 1,...,/) which are 
copies of the machines Z(Yi) (also suitably modified). 

More precisely, let a positive command 9 of S(M) differs from the unique start and 
accept commands of S(M) and have the form 

[fciiii ->■ k' 1 u' 1 ,v 1 k 2 u 2 -> uifcau'a, ...jUjfcj+i ->• 
where fcj, ^ G -K^, ttj and i>j are words in Yj. Then its copy in M o Z is 



= 



[fciiti -> fci«i,uipi(0,3 ) ->■ uipi(0, 1 + ), /c 2 M2 -> k' 2 u' 2 ,..., 



v lPl (9,S-)^v' lPl (9,l + ),ki +1 ^k' l+1 ] 

with Y 2 j_i((9) = Yj O (0) and Y 2 j(#) = for every i, in particular, the words u^Vi are 
rewritten here in alphabet Y i . 

Thus the modified rule from 9 G turns on / copies of the machine Z(A) (for different 
A's). 

Each machine — ) is a copy of the machine Z(Yi), where every rule r = [C/i — > 

V\, U 2 — > V 2 , U 3 — > V 3 \ is replaced by the rule of the form 



fi(9,-) 



U 1 ^V 1 ,U 2 ^V 2 ,U 3 ^V 3 , 

kj ->■ kj, Pj (9, 3~) -»Pj(0,3~),j = - 1, 

p s (0, 1~) ->• p s (6>, 1"), A; s+ i ->■ fc a+ i, s = i + 1, I 



where Ui, U 2 , U 3 , V±, V 2 , V 3 are obtained from Ui, U 2 , U 3 , Vi, V 2 , V 3 , respectively, by replac- 
ing p(j) with Pi(9,j~), L with ki and R with k i+i , and for s ^ i, Y 2s _ 1 (fj(0, — )) = Y ij0 . 

Similarly, each machine Zi(9,+) is a copy of the machine Z(Yj), where every rule 
r = [Ui — > Vi, C/ 2 — >■ V 2 , U 3 — )■ V 3 \ is replaced by the rule of the form 



Ps (9, 1+) ^> Ps (9, 1+), A£ +1 , s = i + l,...,l 



1, - 1, 
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where Ui, U 2 , U3, Vi, V 2 , V 3 are obtained from Ui, U 2 , U 3 , Vi, V 2 , V3, respectively, by replac- 
ing p(j) with Pi{0,j + ), L with k\ and R with k' i+1 , and for s 7^ i, Y 2s -i(fi(9, +)) = Y i . 

If 9 is the start (the accept) command of S(M), then we introduce only machines 
Zi(0,+) (only Zi(0,—), respectively), and replace the letters Pj(0,3~) (replace Pj(0, 1 + ), 
resp.) by pj in the above definition of the command 0. 

In addition, we need the following transition rules ((9, — ) and ((9, +) that transform 
all p- letters from and to their original. 

[hi -> ki,pj -4 Pj{9, = 1, ...,/ + 1, j = 1, 

[k[ k^ptf, 3+) -4 Pi , i = 1, Z + 1,3 = 1, /]. 

If 9 is the start (the accept) command of S(M), then we introduce only ((9, +) (only 
((9, — ), respectively). 

Thus while the machine — ) (the machine Zi(9, +)) works all other machines 
Zj(9,—) (all machines Zj(9,+), j 7^ z) must stay idle (their state letters do not change 
and do not move away from the corresponding fc-letters). After the machine Zi(9, — ) (the 
machine Zi(9, +)) finishes, i.e., the state letter Pi(0, 3^) appears next to k i+1 (next to k' i+1 ), 
the next machine Z^i(9, — ) (the machine Zi + i(9, +)) starts working. The transition rule 
((9, — ) switches on the consecutive works of the machines Z±(9, — ), . . . , Zi(9, — ). After 
all p-letters have the form pj(9,3~) we can apply the rule 9 and turn all pj(9,3~) into 
Pj(9, 1 + ). This switches on the consecutive work of Zi(9, +),... , Zi(9, +), followed by the 
transition rule ((9, +). 

Thus, in order to simulate a computation of the symmetric TM M (and of the S- 
machine S(M)) consisting of a sequence of applications of rules 9 X , 9 2 , 9 S , we first apply 
all rules corresponding to 9i, then all rules corresponding to 9 2 , then all rules corresponding 
to #3, etc. The language £$ of S consists of some words u in the alphabet Y\ q. In 
particular, every input admissible words of S is of the form E(it) = k\up\k 2 p 2 k 3 . . . kx-\p\k\. 
We denote by S = So(«S) the accept word of S. 

The modified rules 9 of S(M) will be called basic rules of the S"-machine S. 

2.6 Computations of machine S 

Given a computation C of S, one obtains a computation Cs(m) of the S(M) after omitting 
all the (copies of) commands of Z, deleting the additional state letters of Z from the 
admissible word of the computation C, and replacing the letters from alphabets Y^o by 
their copies in Yi (By Lemma [2761 (2), the computation Cs{m) is well-defined, although 
Cs M is of length if C has no basic rules.) 

Lemma 2.7. (1) The start and the accept rules of S are basic rules which are the copies 
of the start and the accept commands of M, respectively. The machine M o Z satisfies the 
Siq- condition. 

(2) If the history h of a reduced computation C : w —>••••—>• w t of S contains no 
basic commands, then \\wi\\ < max(| |u> | |, ||w t ||) for every % (0 < i < t). If the only basic 
letter of h is the last one, then \ \wi\ \ < ||wq|| fori < t. 

(3) If a computation C of S is reduced then Cs(m) is also reduced. 

(4) For any reduced computation C : Wq — >• • • • — )■ Wt of S starting and ending with 
basic rules we have spaces{C) = space $ (m) {Cs(M))- 
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(5) For every positive reduced computation C : iu —>••••—>• Wt 0/ £/ie machine S(M), 
there is a canonical reduced computation C$ of S whose history starts and ends with basic 
commands, such that (Cs)s(M) — C. Moreover, we have spaces(Cs) = spaces(M)iC). 

Proof. (1) Property (1) follows from the similar property of the machine M and from the 
definition of the machine M o Z 

(2) We start with the first statement. If the history of the whole computation consists of 
the commands of one machine Zi(9, ±), then the statement follows from Lemma [2.6( 3). 
Otherwise it has s > 1 admissible words , Wi s , such that the history of every 
subcomputation 

C : w ->■ >w h , Cj : ->■ > C s : w is ->■ > w t 

either consists of the commands of some Zi(9, ±), being a maximal subcomputation with 
this property, or has only one transition letter ((9, ±) for some 9. In the former case, 
all the admissible words participating in Cj have the same length for j 6 [1, s — 1] by 
Lemma [2761 (5.6). The same is clearly true in the latter case. Therefore it suffices to prove 
that the lengths of the admissible words do not decrease in the subcomputations Cq 1 and 
C s . 

Let us consider C s only, assuming that it is a computation of some machine Zi(9, ±). 
It corresponds to a computation LupvR = Wo —)■•-•—)■ W m = Lu'pv'R of a machine 
of the form Z(A) with m = t — i s . Since s > 1 and C s is a maximal subcomputation 
corresponding to Zi(9, ±), we must have \ \v\ \ = 0. (Otherwise only commands of Zi(9, ±) 
could be applied to Wi s , and so Wj s -i —> Wi s — > ••• — » w t were a longer computation 
of the same Zi(9,±).) Hence the projection of the word uv onto A is reducible, and so 
1 1 Wo 1 1 < 1 1 for every % G [1, m] by Lemma 12.61 (2). Therefore either all Wi-s have 
equal lengths or ||Wo|| < ||Wi|| < • • • < ||W m || by Lemma [2761 (3). In any case, we have 
1 1 Wo 1 1 < • • • < 1 1 W m | |. This implies | \w ia || < • • • < \\w t \\, as required. 

The proof of the second claim is similar: The computation w ix —)■•••—>■ w is is product 
of subcomputations subcomputations Cj-s which preserve the lengths of Wj-s, while C 
and C s cannot increase the space. 

(3) Assume that n ... Tt is a history of a computation Wq —>■■••—>• Wt of <S, where T\ 
corresponds to a positive command 9 of S(M), r t corresponds to 9~ 1 , and other rules are 
not basic. Then non-empty history of the computation w± — > ■ ■ ■ — > Wt-i is a product 
Hi . . . H s where Hi (i — 1, . . . , s) are maximal subwords corresponding to some Zj^(9i, ±). 
Since Hi and H s correspond to Zi(9, +), either s = 1 or there is i such that both i?i_i and 
H i+ i correspond to the same Zj^±i(9', ±). It follows that the computation with history 
Hi satisfies the assumption of Lemma [2.61 (6). a contradiction. 

(4) Property (4) follows from (2) and the definition of the computation Cs(m)- 

(5) Given C, the computation Cs with the same space and with property (Cs)s(M) — C 
is briefly described above at the end of subsection 12.51 and with more details (though with 
submachines Zi(9, +) but without Zi{9, —)) in subsection 3.7 of [22]. □ 

Lemma 2.8. Let C : Wq —)■•••—)■ w t be a reduced computation of S such that the first 
command 9i and the last commands 9 t are basic ones, and Cgm) — W) —>■-••—>■ W s with 
s > 2. Then the subcomputation Wi —>■•••—>■ W s _i is positive and t — 2 > 2^ Wl ^ a ' 1 . If 
9i (if @t) is a start (is an accept) command, then the word Wo (respectively, W t ) is also 
positive. 
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Proof. To justify the first claim of the lemma, it suffices to prove that the word W\ is 
positive under assumption that there are no basic commands in the computation W\ — > 
•••—>■ w t -i, and the word W\ corresponds to w\. Therefore it suffices to prove that the 
word w\ is positive. 

We first assume that the first command 8 of the history of C is a positive basic com- 
mand. Then 8 switches on the machine Zi(8, +). Since s > 2, this machine must complete 
its work before the computation C ends. The computation of Zi(8, +) cannot be empty 
since otherwise 9 were followed by 9~ 1 because W\ involves the state letter p±(9, 1 + ). But 
this would contradict to the reducibility of C. 

Hence, by Lemma 1231 (6). the work of Zi(8, +) soon or later leads to an admissible word 
Wi containing the state letter pi(8, 3 + ). By Lemma [2.6( 5). the subword of w\ of the form 
kiUxPx(8, l + )fc2 is positive, and the time of the work of Z\(8,+) is at least 2^ U1 ^. Then 
the machine Z±(8, +) ends working and switches on the machine Zi{8, +), whose work 
similarly provides the positiveness of the subword of W\ having form k2ii2P2{8, l+)ks. 
Finally we obtain that w\ is covered by positive subwords, and so it is positive itself. 
Besides the time of work of all Z x (8, +),..., Zi(9, +) is at least 2^^ 1 since |Wi| a = 

If the command 8 1 is positive, then it switches on the machine Zi(9, — ), and we first 
obtain the positiveness of kiUiPi(8,3~)ki + i, then the positiveness of hi-iUi-iPi-i(8,3~)ki, 
an so on. 

Since start and accept commands leave tape letters unchanged, the second statement 
follows from the positiveness of the word W% (of the word Wt-i)- The lemma is proved. □ 

Lemma 2.9. (1) The S-machine S and the symmetric NTM M recognize the same lan- 
guage C. 

(2) The space functions Sm(p) and Ss{n) of M andS, respectively, are equivalent. 

(3) The generalized space functions S' M (n) and S' s (n) of M and S are equivalent. 

(4) We haveT '(n) >z ex P(<S , M( n )) for the generalized time function T'(n) of the machine 

S. 

Proof. (1) Assume that a word u belongs to the language £ recognizing by M. By 
Lemma I2T5] this word belongs to the language of S(M), and the accepted computation 
C is positive. By Lemma I2TT 5). u belongs to the language of C$ of S. 

Now suppose u belongs to Cs, and C is the accepted computation. Then the computa- 
tion Cs(m) is positive by Lemma |2~8| and therefore this computation is also an accepted 
computation of the machine M by Lemma I2.5[ and so u e L. 

(2) The above argument shows that if a reduced computation C : wq —)■•••—)■ w t of S 
accepts an input admissible word w , then the computation C${m) is a positive accepted 
computation of both S(M) and M, and so 5m(w) < Ss(n) by Lemma [2T71 (4). On the 
other hand, every accepted input configuration W of M is accepted by S(M). By Lemma 
12.7( 5). it has a copy accepted by S, and moreover, the accepting computations of M and 
S need the same space. Therefore Sm(ji) > Ss{n). 

(3) Assume that C : W = W Wt is an accepting computation of M such 
that spaceM{C) = spaceM{W), and Cs '■ w = wo — > ■ ■ ■ —> w s . For given w and w s , we 
also consider a reduced computation C : w — > ■ ■ ■ — > w s of S with minimal space and 
the computation (C') S (m) '■ Wq ->••■•—>■ W(,. If f = (no basic rules), then W = W t 
by Lemma 1231 (2), and so spaceM{C) = spaces (C). Then we assume that t' > and 
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note that \\Wq\\ = \\W \\ and HW^H = ||Wt|| by Lemma 12761 (2) since the corresponding 
admissible words of S can be connected by computations without basic rules. Since C' S , M \ 
is positive by Lemma |2~8| it is also an accepted computation of the machine M by Lemma 
12.51 Therefore, by Lemma 12771 (4) . space m(C) < space s(M){C'sfM)) = spaces (C). Since 
\w\ a = \W\ a , the last inequality proves that S' M {n) < S' s (n) for every n. 

Now we consider any accepting computation C : w = wq —)■•-•—)■ w t of S with 
\w\ < n, such that spaces(C) = spaces (w). Without loss of generality, we may also 
assume that spaces (C[m\) = spaces (w m ) for every sub computation C[m] of the form 
w m ~^ ■ ■ • — > w t . Let 9i i: . . . , 9 is be the basic commands of the history h = 6\ . . . 6 t of 
C, C be the subcomputation of C with history h! — 9\ . . - O^-i, and C" have history 
h" = 9 h ... 9 t - and so h = h'h" and C = C'C". 

We denote by W — >■ W% ->••••—>■ W s the computation (C") S {m) and by (C") S (m)[1] 
the subcomputation W\ —)■•••—)■ W s , which is positive by Lemma 12.81 Note that 
space s(m)((C")s(m)[^]) = spaces(M)(Wi) since otherwise the subcomputation of C[ii] 
could be replaced by a subcomputation which needs less space by lemmas (5). There- 
fore by Lemma 12.71 (4) , 

space s {C") = space s{M ){{C") S (M)) < space 5(M )((C" / )5(Af)[l]) + |||W / i||-||W ; o||| = space S { M ) 

since |||Wi|| — ||Wo||| is bounded by a constant c depending on the machine M only. 

By Lemma I2TT1 (2). we have ||tUj|| < \\wq\\ for j < ii, and so spaces(C) < \wo\ a . Now, 
since ||Wi|| < ||Wo|| + c < ll^n-ill + c < ll w o|| + c ; we g e t 

spaces{C) < ma,x(spaces(C), spaces (C")) < 
max(|w | a , space S ( M )(W 1 ) + c) < max(|w | a , S' s{M) (\w \ a + c) + c 

Hence S' s {n) < ma 1 x(S' s , M Jn + c) + c,n). This inequality together with the inequality 
S' M {n) < S' s {n) obtained earlier, show that S' M (n) ~ S' s (n). 

(4) Let again C : W = Wq —>■•••—>■ W t be an accepting computation of M such that 
spaceM^C) = spaceM{W) = S"(|W| a ), and C$ '■ w = w — > ■ ■ ■ — > w s . For given w and 
w s , we also consider a reduced computation C : w —>•••—)■ w s of <S with minimal tzme s 
and the computation (C")s(M) : W 7 ^ W 7 /, , where one has H 7 ^ = W and W 7 /, = W t . 

Therefore (C')s(m) is a positive computation by Lemma |2~8| and by the choice of C and 
Lemma |275| space m(C) < spaceM{{C')s{M))- Consider also a maximal subcomputation 
C" of C starting and ending with basic commands. Then (C')s(m) — (C")s(m)- By 
Lemma [2781 space s(M){C") s(m) < c\og(times(C")) for a constant c > 0. Thus by Lemma 
[231 

^'(l^la) = space M (C) < space M {{C') S (M)) = space s(m){{C') S (m)) 
= spaces { M){{C")s {M) ) < c\og{time s {C")) < clog(time s (C')) < clog(T'(|W| a )), 
and the lemma is proved. 

□ 

Lemma 2.10. (a) For every DTM M recognizing a language C and having a space func- 
tion Sin), (b) for every NTM M recognizing a language C and having an FSC space 
function f(n), there exists an S -machine S with the following properties. 

1. The machine S recognizes the language C 
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2. Respectively, (a) both the space and the generalized space functions ofS are equivalent 
to S(n), (b) both the space and the generalized space functions of S are equivalent to 

f(n) 2 )- 

3. Every command of S or its inverse inserts/ deletes at most one letter on the left and 
at most one letter on the right of every state letter. 

4- The machine S satisfies the S\Q-property. 

5. The unique start command is of the form qi — > q[,q2 —> q'2, ■ ■ ■ > <lk+i ~> Qh+u where 
(q u . . . , gjfe+i) = si- 

6. Any state letters q from Q\ is passive, i.e., there are no commands of S of the form 
q q'u with non-empty a-word u. 

Proof. For every DTM M recognizing a language C and having a space function S(n) 
(for every NTM M recognizing a language £ and having an FSC space function f(n)) 
one can construct a symmetric NTM M' , as in the formulation of Lemma [2.41 Then the 
properties 1 and 2 hold for the machine S = M' o Z by Lemma 12.91 

To provide the third property we use the same trick as for property 4 of Lemma 12.41 
The properties (1) and (2) are obviously preserved. 

To obtain the Si -condition of S, it suffices to add new state letters and two new 
(positive) commands. Such a modification preserves the other properties of S, as this was 
noticed at the end of subsection 12.11 Now the s*io-condition for S follows from Lemma 12.71 
(1). The form of the start command follows from our agreement that all tapes number 
2, . . . , k are empty for the input configurations of the machine M. Finally, the left-most 
head of the Turing machine M is passive being equal to the separating symbol cti; and the 
same property is inherited by S as this follows from the definition of composition M' o Z. 

□ 

3 Groups and diagrams 
3.1 Construction of embeddings 

Let if be a finitely generated group with solvable word problem. To prove Theorem 11.21 
we will suppose that a Turing machine M solves the word problem in H. This implies that 
H has a finite set of generators {a\, ..,a m }, and a word w in generators aj-s is accepted 
by M iff w — 1 in H. Here we consider only positive words in the generators since M can 
work with positive words only, and so we assume that the set of generator is symmetric: 
for every a«, there is a generator aj such that a^aj — 1 in H. Thus every relations of H 
follows from relations in a 1; . . . , a m , with positive left-hand side. 

Further we will assume that one of the hypotheses (a), (b) of Lemma 12.101 holds. 
Therefore we also have the S'-machine S = (X, Y, Q, 0, Sq) provided by that Lemma. 
The input alphabet of S is the system of generators ai, . . . ,a m of the group H together 
with the symbols of the inverse letters {a^ 1 , ..,a^}. Let S' s (n) be the generalized space 
function of S. 

We denote by <S = (X, Y, Q, Q,~hi,~so) a copy of the S'-machine S. We will assume that 
X = X, Y HY = X, Q H Q consists of the state letters of the vectors s\, and s*o, and 
H = 0. Therefore the machines S and S have the same input admissible words. 
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The copy of a command 9 of S is called 9. Similar notation is used for the a-letters 
from Y and g-letters from Q. The set of rules of the machine and admissible words of 
S U iS is by definition, the union of the corresponding sets for S and for S. The following 
lemma is a clear consequence of these definitions. 

Lemma 3.1. The sets of the accepted input admissible words of the machines S, S and 
SUS coincide, and so these machines recognize the same language C. They also have equal 
space functions and equal generalized space functions. For every accepting computation 
Wi — y . . . of S U <S, there is an accepting computation W\ — > . . . of either S or S whose 
length and space does not exceed the length and space of the original computation. 

We consider a group G(S,L)) associated with the machine S. Furthermore as in |21j . 
we need a very similar group G(S, L) to produce a group embedding required for the 
proof of Theorem 11.21 

To define G(S,L)) we need many copies of every letter used in the work of S; this 
enables to apply a kind of hyperbolic argument for the hub structure of van Kampen 
diagrams. Moreover, the copies alternate with "mirror copies"; this trick works in Section 

n r 

Therefore we "multiply" the machine S as follows. For some even L > 40, we introduce 
L/2 copies S = S\,S 3 , . . . of the machine S and L/2 mirror copies iS> 2 ,iS>4, • • • ,<Sl 

of S (A mirror copy of an arbitrary word X\ . . . x n is, by definition, x n ...X\. For even 
i, the rules of <Sj(L) transform the words in the mirror manner in comparison with S.) 
We also add auxiliary separating state letters k\,...ki,. For every admissible word W 
of the machine S, we define the words W = W\, W 2 , ■ ■ ■ , Wl, where Wi, W 3 , . . . Wl-\ 
are copies of W in disjoint alphabets and W2, W4, Wl are mirror copies of W also in 
disjoint alphabets. The words of the form k{Wik2W2 ■ ■ ■ kiWi are admissible word of the 
machine S(L). The rules of S(L) are in one-to-one correspondence with the rules of S; 
they transform the words W\, W3, ... as the commands of S, and transform the words 
W 2 , W4, ... in the mirror manner, and they do not change the new state letters ki,...,kz,. 
We identify the set of rules of S(L) with 6. Thus, by definition of S(L), we also manifold 
the input and accept configurations. If u is an input word for S, then we denote by 
S(m,L) the corresponding input configuration for S(L) containing L/2 copies of u and 
L/2 mirror copies of u as subwords. If the admissible words of S have K state letters, 
then the admissible words of the machine S(L) have iV = (K + 1)L state letters. Clearly 
the machine S(L) enjoys the properties (l)-(5) of machine S listed in Lemma [2.101 

The finite set of generators of the group G(S, L) consists of q-letters corresponding 
to the states of S(L), a-letters corresponding to the tape letters of S(L), and 9 -letters 
corresponding to the commands. Thus the set of generators consists of the set of (state) 
g-letters Q(S, L) = uf =1 Q h the set of (tape) a-letters Y = Uf =1 Yi including X = Uf =1 X h 
and the ^-letters from iV copies of G + , i.e., for every 9 G + , we have N generators 
9i, . . . , 9 N . 

The relations of the group G(S,L) correspond to the rules of the machine S(L); for 
every 9 = [Ut ->• V\ , . . . U N ->• V N \ G 6 + , we have 

UA+i^OiVi, 9 j a = a9 j , i,j = l,...,N (3.4) 

for all a G Yj{9). (Here 9n+i = 9\. ) The first type of relations will be called (9,q)- 
relations, the second type - (9, a)-relations. 
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The definition of machine S(L) is similar to that of S(L) but the admissible words 
are of the form k\W\k2W2---kjJW l, where every W{ is obtained from Wi after replacement 
of every letter x by its copy x, and for i = 1, we, in addition, delete all a- letters, i.e., 
the word W\ has no a-letters. (In other words, instead of the first copy of S, we use the 
"machine" with the same state letters but having no tape letters.) In particular, the word 
£(«, L) is obtained from £(w, L) by omitting of the first occurrence of the input word u. 
Again, it is obvious that properties (1) - (5) from Lemma \2. 101 hold for the machine <S(L) 
as well. The relations of the group G(S, L) are 

uA+i = 0iVi, i = l,...,N, 6,0, = a9 3 (3.5) 

for all a G Yj(9) and j 6 [K + 1, N]. 

We will also use the combined ^-machine S(L) US(L). Its admissible words are either 
the admissible words for S(L) or the admissible words for S(L), the set of rules is the 
union of rules for S(L) and S(L). Note that the machine S(L) US(L) does not satisfy the 
Sio-condition. 

The subwords of the form (kiWik^i)^ (indexes modulo L) of the admissible words of 
the machine S(L) are said to be an i-sector words. Similarly one define z-sector words for 
the machines S(L) and for S(L) US(L). (Recall that the ^-letters of the 1-sector words of 
S(L), except for k\ and k%, are identified with the corresponding letters of the original S- 
machine S.) The state letters of the i-sector of S(L) are the letters from ^j=a-l)(K+i)+iQj' 
and the tape letters of the i-sector are the letters from ^j=u-l)(K+i)+i^r Similarly we have 
state and tape letters of the i-sector for S(L) and for S(L) U S(L). By definition, the 
^-letters 9j and 9j with subscripts j E [(i — l)(K + 1) + l,i(K + 1)] are ^-letters of the 
i-sector of S(L) U S(L). The state, tape and the 0- letters of the i-sector constitute the 
alphabet At of the z-sector. 

The group G(S US, L) is given by the generators and relations of both groups G(S, L) 
and G(S,L). 

Finally, the required group G is given by the generators and relations of the group 
G(S U S, L) and one more additional relation, namely the hub-relation 

S = 1, (3.6) 

where S = S(L) is the accept word (of length N) of the machine S(L) (and of S(L) as 
well). 

Suppose an admissible word W of S(L) is obtained from an admissible word W 
by an application of a rule 6 : [Ui — > Vi,...,Un — > Vat]- This implies that W = 
U1W1U2W2 ■ ■ ■ UnWn, where Wi is a word in the alphabet and therefore W = Q^WQi 

in G(S,L) by relations (13.41) . since Q\ = 9n+i- 

Now suppose u is a positive word in the alphabet {ax, . . . , a m } vanishing in the group 
H. Then it is recognized by machine S, and so the word £(«) is accepted by S(L) and 
therefore it is conjugate of S in the group S(L) US(L). Consequently, we have £(«) = 1 
in G by (13.61) . Similarly, £(«) = 1 in G. 

Recall that we identified the alphabet of 1-sector words of S (L) with the alphabet of S. 
Therefore the word £(«) results from E(it) after deleting the subword u in the alphabet 
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of generators of H. Hence the relations S(u) = = 1 imply u — 1 in G. Since the 

language of accepted words for S(L) contains all the defining relations of H, we have 
obtained 

Lemma 3.2. The mapping ai (—)• (i — 1, . . . , ai) induces a homomorphism of the group 
H toG. 

In Section H] we show that this homomorphism is injective. 
3.2 Minimal diagrams 

As in [19], we enlarge the set of defining relations of the group G by adding some con- 
sequences of defining relations. Taking into account Lemma [3 .2\ we include all cyclically 
reduced relations of the group H generated by the set {ai, . . . ,a m }, i.e., all non-empty 
cyclically reduced words in {af 1 ,. . . , a^ 1 } which are equal to 1 in the group H. These 
relations will be called H-relations. 

We denote by Gi the group given by all generators of the group G, by all if -relations, 
and by all defining relations of G except for the hub- relation (13. 6p . 

Recall that a van Kampen diagram A over a presentation P = (B | 1Z) (or just over the 
group P) is a finite oriented connected and simply-connected planar 2-complex endowed 
with a labeling function <fi : E(A) — > B^ 1 , where E(A) denotes the set of oriented edges 
of A, such that 0(e _1 ) = 0(e) -1 . Given a cell II of A, we denote by dU the boundary 
of II; similarly, <9A denotes the boundary of A. The labels of 911 and <9A are defined up 
to cyclic permutations. An additional requirement is that the label of any cell II of A is 
equal to (a cyclic permutation of) a word R , where R G 71. Labels and lengths of paths 
are defined as for Cayley graphs. 

The van Kampen Lemma states that a word W over the alphabet B represents 
the identity in the group P if and only if there exists a diagram A over P such that 
4>(dA) = W ([T5], Ch. 5, Theorem 1.1). 

We will study diagrams over the groups G and G\. The edges labeled by state letters 
( = q-letters) will be called q-edges, the edges labeled by tape letters (= a-letters) will be 
called a-edges, and the edges labeled by the letters from 9 and (= Q-letters) are Q-edges. 
The cells corresponding to the relation (I3.6P are called hubs, the cells corresponding to the 
relation (13.41) and (13.51) are called (Q,q)-cells if they involve g-letters, and they are called 
(Q, a)-cells otherwise. The cells corresponding to arbitrary relations of H are H-cells. 

The obtained presentation and diagrams over it are graded by the ranks of defining 
words and cells as follows. The hubs are the cells of the highest rank, the rank of (Q, q)- 
cells is higher than the rank of if-cells (and in Lemma [5. 1[ (Q, fcj)-cells, with i ^ 1, 2 are 
higher than other (Q, g)-cells), and the (Q, a)-cells are of the lowest rank. 

If A and A' are diagrams over G, then we say that A has a higher type than A' if A 
has more hubs, or the numbers of hubs are the same, but A has more cells which are next 
in the hierarchy, and so on. 

Clearly the defined partial order on the set of diagrams satisfies the descending chain 
condition, and so there is a diagram having the smallest type among all diagrams with 
the same boundary label. Such a diagram is called minimal. 
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3.3 Bands and trapezia 

From now on, we shall mainly consider minimal van Kampen diagrams. In particular the 
diagrams are reduced, i.e., they do not contain cells that have a common edge and are 
mirror images of each other. To study van Kampen diagrams over the groups G and G\ 
we shall use bands and trapezia as in [25] , [I]. 
Here we repeat some necessary definitions. 

Definition 3.3. Let Z be a subset of the set of generators X of the group G. An 2-band 
B is a sequence of cells 7Ti, 7r n in a van Kampen diagram such that 

• Each two consecutive cells 7Tj and 7Tj + i in this sequence have a common edge labeled 
by a letter from Z. 

• Each cell 7Tj, i — 1, ...,n has exactly two Z-edges, ej_i and (i.e. edges labeled by 
a letter from Z). 

• If n = 0, then i3 is just an Z-edge. 

The counterclockwise boundary of the subdiagram formed by the cells 7Ti, ...,7r n of i3 
has the factorization e^qifq^ 1 where e = e is an Z-edge of tti, / = e n is an Z-edge of 7r n . 
We call (ft the bottom of £> and g 2 the top of B, denoted bot(B) and top(£>). Top/bottom 
paths and their inverses are also called the sides of the band. The Z-edges e and / are 
called the start and end edges of the band. If n > 1 but e = /, then the Z-band is called 
an Z-annulus. 

We say that an J?i-band and an i^-band cross if they have a common cell and Z\V\Zi = 

0. 

We shall call an Z-b&nd maximal if it is not contained in any other ^-band. 

We will consider g-bands where Z is one of the sets Qi of state letters for the machine 
S(L) U S(L), 0-bands for every 6 G 0, and a-bands where M = {a} C Y . 

The convention is that a-bands do not contain g-cells, and so they consist of (9, a)-cells 
only. 

The papers [18] , [1], [21] contain the proof of the following lemma in more general 
setting. (In contrast to lemmas 6.1 [18] and 3.11 [21], we have no x-cells here.) 

Lemma 3.4. A minimal van Kampen diagram A over G\ has no q-annuli, no 9-annuli, 
and no a-annuli. Every 6-band of A shares at most one cell with any q-band and with any 
a-band. 

If W = x\...x n is a word in an alphabet X, Y is another alphabet, and 0: X — > YU{1} 
(where 1 is the empty word) is a map, then 4>{W) = 0(xi)...0(x n ) is called the projection 
of W onto Y. We shall consider the projections of words in the generators of G onto 
6U0 (all ^-letters map to the corresponding element of G U 6, all other letters map 
to 1), and the projection onto the alphabet {Q\ U • • • U Q N } (every q- letter maps to the 
corresponding Qi, other letters map to 1). 

Definition 3.5. The projection of the label of a side of a g-band onto the alphabet Q ±x 
is called the history of the band. The projection of the label of a side of a #-band onto the 
alphabet {Qi, ...,Q n } is called the base of the band. Similarly we can define the history 
of a word and the base of a word. The base of a word W is denoted by base(W). It will 
be convenient instead of letters Q±, ...,Qn, in base words, to use representatives of these 
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sets. For example, if k G Qi, q G Q2; we shall say that the word /cag has base /eg instead 
of QiQ 2 - 

Definition 3.6. Let A be a minimal van Kampen diagram over G\ which has the contour 
of the form p± q\p 2 q 2 where: 

(TRi) pi and p 2 are sides of g-bands, 

(TR 2 ) qi, q 2 are maximal parts of the sides of #-bands such that <fi(qi), 4>{q 2 ) start and 
end with g-letters, 

(TR3) for every #-band T in A, the labels of top(T) and bot(T) are reduced. 




Then A is called a trapezium. The path q\ is called the bottom, the path q 2 is called 
the top of the trapezium, the paths p\ and p 2 are called the left and right sides of the 
trapezium. The history of the g-band whose side is p 2 is called the history of the trapezium; 
the length of the history is called the height of the trapezium. The base of gi is called the 
base of the trapezium. 

Remark 3.7. (1) Property {TR3) is easy to achieve: by folding edges with the same 
labels having the same initial vertex, one can make the boundary label of a subdiagram 
in a van Kampen diagram reduced, see [25] . 

(2) Notice that the top (bottom) side of a #-band T does not necessarily coincide 
with the top (bottom) side q 2 (side gi) of the corresponding trapezium of height 1, and 
q 2 (gi) is obtained from top(T) (resp. bot(T)) by trimming a first and last a-edges if 
these paths start or/and end with a-edges. We shall denote the trimmed top and bottom 
sides of T by ttop(T) and tbot(T). By definition, for arbitrary #-band T, ttop(T) is 
obtained by such a trimming only if T starts or/and ends with a (8, g)-cell; otherwise 
ttop(T) = top(T). The definition of tbot(T) is similar. 

The trapezium A is said to be a z-sector if the labels of top and bottom paths are 
i-sector words. 

Lemma 3.8. Let T be an i-th sector, where i 7^ 1. Then the sides p\ and p 2 are the sides 
of maximal kt- and k i+ i-bands /Q and ofT, respectively. If an edge e ofT, belongs 
to neither /Q nor /Q + i, then 0(e) G Ai. In particular, T has no H -cells. 

Proof. The first assertion follows from Lemma \'3 .41 since the labels of the top and bottom 
of an z-sector are of the form ki . . . k i+ i. Then, by the same lemma, the second assertion 
is true for the edges of all #-cells since every maximal #-band must connect /Q and /Q+i. 
Since j ^ 1 and the labels of the boundary edges of if-cells belong to A\, the if -cells of 
T have no edges in common either with the #-cells of V or with the boundary BY. Now 
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the minimality of the diagram T implies that V has no i^-cells at all, and so the lemma 
is proved. □ 



The following lemma claims that every i-sector, i ^ 1, simulates the work of S(L) U 
S(L). It summarizes the assertions of lemmas 6.1, 6.3, 6.9, and 6.16 from [21]. For the 
formulation (1) below, it is important that S (and S U S) is an S"-machine. The analog 
of this statement is false for Turing machines. (See [IH] for discussion.) 

Lemma 3.9. (1) Let A be an i- sector for some i^l with history 9\ . . .9$. Assume that 
A has consecutive maximal 8-bands 71, . . . Ta, and kiWjki + \, and kjW'jki+x are the bottom 
and the top labels of Tj, (j = 1, • . . , d). Let Uj (resp. Vj, i = 1, . . . d) be the copies of Wj 
(resp. Wj) in the alphabet of the machine S U S. Then Uj, Vj are admissible words for 
5U5, and 

v 1 = u 1 o6 u u 2 = v 1 ,...,u d = v d ^v d = u d oe d 

(2) For every reduced computation U o h = V of S (of S) with \h\ > 1 and for every 
i G [1,N] (for every i G [2, N]), there exists an i-sector A with history h and without 
H-cells, whose bottom and top labels are kiU'ki + \ and kiV'k i+ i, where U' (resp. V) is the 
copy or the mirror copy of the word U (ofV)in the alphabet Ai. These copies are mirror 
ones iff i is even. 

□ 

We call an i-sector accepted if the top label kiV'ki + \ of it is the i-sector subword of the 
word S . For i ^ 1, the computation V\ Vj of the machine 5U"5 provided by 

Lemma [3791 (1) for an accepted i sector is accepting. 

3.4 Replicas 

Let A be an i-sector, where j ^ 1. Then by Lemma 13.81 f° r every i' ^ 1, one can relabel 
the edges of A (or of the mirror copy of A if i — i' is odd) and obtain an i'-sector A', which 
is just a copy (or a mirror copy) of A. But one cannot construct such a copy if i' = 1 since 
there are no commutativity relations 6jd = adj if 9j G Ai (see (13. 5p ). However one can 
construct an ersatz-copy of A called replica if A is an accepted sector. This construction 
will be used in Section 5. 

For i' — 1, we construct the replica of A as follows. (We assume below that i is odd, 
otherwise one first replaces A by its mirror copy.) 

At first, the relations (13 .4p and (13 .5p make possible to replace the maximal /cj-band /Q 
and /cj + i-band of A by their copies /Ci and /C 2 , respectively. Similarly, we replace 
every maximal #-band of A by its copy if 9 is a command of the machine S. Now let T be 
a maximal #-band of A and 9 a command of the machine S. This T consists of (9, q)- and 
(9, a)-cells. To construct the replica T' of T we take the 'copies' of (9, g)-cells only (but 
no a-edges in these 'copies', the a-edges are contracted to vertexes !) and build T' from 
them identifying #-edges of neighbor cells in the order in which the original (9, g)-cells 
appear in T. 

It remains to close up the holes between 6*-bands T' s and T' s +i for consecutive T s and 
T s+ \. If the corresponding letters 9 S and 9 s+ i are both the commands of S, then we just 
identify the top of the copy T' s and the bottom of the copy T' s +i- This is possible since 
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4>(topT s ) = 0(bot7^+i) by Lemma \3. 91 (1). Similar identification works if 9 S and 9 S+1 are 
both the commands of S. 

Assume now that 9 S is a command of S and 9 s+ \ is a command of S (or vice versa). 
Then 0(top(7^)) = 0(bot7^ + i)) = kiW s k i+ i, and the copy W' s of W s in the alphabet A± 
is an admissible word by both machines S and S since both commands 9' 1 and 9 s+ i are 
applicable to it. But the only common tape letters of these machines are the letters of the 
input alphabet of S, and the only common state letters of these machines are the letters 
from the start vector s\ and the accept vector s*o of S. Then by property (5) of Lemma 
12.101 9~ l is either the start rule or the accept rule of S. 

In the latter case, we have 0(top(T' s )) = 0(botT' s +i)) = kiW s k i+ i since the accept 
words have no a-letters at all, and so the identification of the top of the copy T' s and the 
bottom of the copy T' s +i is possible again. Let us consider the former case. Then the 

rule 9~ l is of the form q± — > q[, q 2 — > q' 2 , . . . , qx — > q'x > where (q±, . . . , qx) = s\. Therefore 
W' s can have tape letters only between q 1 and q 2 , i.e., W' s = qiuq 2 q 3 ■ ■ ■ qx, where u is a 
word in the input alphabet. 

Since the sector A is accepted and the rules of S'-machines are invertible, we have that 
W' s is accepted by the machine S(L) U S(L), and so it is accepted by S by Lemma I3TT1 
Hence the word u belongs to the language recognized by the machine S, and therefore u 
is a word in the generators of H, and u = 1 in H by the definition of that language. 

Now, to close up the hole between the bands T' s and T' s +i, it suffices to paste in 
an H-cell tt labeled by the cyclically reduced form of the word u between them because 
0(top(T a )) = k x q x uq 2 qz ■ ■ ■ qxk 2 and 0(botT s+ i) = k x q x q 2 q z . . . q K k 2 . 




Replica 

The replica A' of the accepted i-sector A is constructed. To summarize our effort: We 
cut off the (9, a)-cell and contract up the a-edges of (9, g)-cells for every command 9 of 
the machine S from A, replace the maximal fc r and fcj+i-bands by their k\- and k 2 -copies, 
replace all the labels of the remaining edges by their copies from the alphabet Ai, and 
then close up all the holes by pasting in several if -cells. The result is the replica A', 
canonically obtained above. 

Remark 3.10. (1) The z-sector A is a union of K + 1 subsectors r 1; . . . , T^+i, where 
every Tj is a trapezia with a base of length 2 and with height equal to the height of A. 
The subsector Tj has a common maximal g-band C = Cj+i with Tj + \ for j = 1, . . . , K. If 
i is odd (even), then T 2 (resp., IV ) is the input subsector. When we construct a replica, 
if-cells appear only in the input subsector of the replica. 
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Every Tj has its own replica 1^-, which is a subsector of A'. Similarly, every q- or 6*-edge 
of C has a replica in the replica C of C in A'. Also every vertex o of C belongs to either 
a g-edge or a #-edge since the boundary of a (9, g)-cell has no two consecutive a-edges by 
Lemma [2.10( 5); and so o has a replica o'. 

(2) The replica of an i-sector is not necessarily a minimal diagram. 

3.5 Discs 

A disc diagram (or a disc) is a (sub)diagram A such that (1) it has exactly one hub II (2) 
there are no #-edges on the boundary <9A (3) There are no if -cells of A having an edge 
on dA. In particular, a hub is a disc diagram. 

Let us consider a disc diagram A with a hub IT. Denote by JCi, . . . , fCi the 
bands starting on the hub II. Since the hub relation has only one letter ki for every i, 
these /c-bands have to end on <9A. It therefore follows from Lemma 13.41 that for every i, 
the bands Ki and Ki+\ bound, together with the OA and (911, either a subdiagram 
having no cells corresponding to any non-trivial relation of G (in this case the bands Ki 
and /Q+i are also trivial), or is a trapezium, and the boundary label of A has exactly 
one letter from Qi for every i G [1 , . . . , N] . 

Similarly, consider two hubs Hi and Il 2 in a minimal diagram, connected by /c r band 
/Cj and /c i+1 -band /Q+i, where (i, i + 1) ^ (1,2), and there are no other hubs between 
these /c-bands. These bands together with the dUi and dU 2 , bound either a subdiagram 

having no cells, or is a trapezium. The former case is impossible since in this case 
the hubs have a common /c^-edge and they are mirror copies of each other contrary to the 
reducibility of minimal diagrams. We want to show that the latter case is not possible 
too. 

Indeed, in the latter case ^i is an accepted trapezium since the i-sector subword of E 
is kiwki+i, where w is a (mirror) copy of the accept word of the machine S. Therefore, 
according to Subsection 13.41 a replica of (as well as the (mirror) copies ^ for every 
j G [2, . . . , N]) can be constructed. Then one can construct a spherical diagram T from 
III, n 2 , and the diagrams (j = 1, . . . , N). There are two subdiagrams T' and T" of 
T with common boundary: T', being a copy of a subdiagram of the original diagram, 
is made of 111, II2, and \& = and T" is a union of all ^-s with j ^ i. Hence the 
subdiagram T' of the original diagram could be replaced by diagram of lower type with 
the same boundary label because T" has no hubs. This contradicts the minimality of the 
original diagram. 

Thus, any two hubs of a minimal diagram are connected by at most two /c-bands, such 
that the subdiagram bounded by them contain no other hubs. This property makes the 
hub graph of a minimal diagram (where maximal fc-bands play the role of edges connecting 
hubs) hyperbolic (in a sense) since the degree L of every vertex (=hub) is high (> 40). 
Below we give more precise formulation (proved for diagrams with such hub graph, in 
particular, in [18], Lemma 3.2). 

Lemma 3.11. If a minimal diagram over the group G contains a least one hub, then there 
is a hub IT in A such that L — 3 consecutive maximal k-bands £>i, . . . Bl-3 start on IT, end 
on the boundary dA, and for any i G [1, L — 4], there are no discs in the subdiagram Tj 
bounded by Bi, Bi+i, Oil, and dA. □ 

Corollary 3.1. The canonical homomorphism H — )■ G given by Lemma WJ^ is injective. 
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Proof. Assume that a word w in the generators ai, a m of the group H is equal to 1 in 
G. Then by van Kampen's Lemma, there is a minimal diagram A over G whose boundary 
label is w. Since w has no g-letters, A has no hubs by Lemma 13.111 By Lemma 13. 4[ A 
contains neither q- nor #-annuli, and so it has neither (9, g)-cells nor (9, a)-cells, because 
w has neither q- nor ^-letters. Hence this diagram can contain if -cells only. Since the 
boundary labels of if -cells are trivial in H, the boundary label w is trivial in H too by 
van Kampen's lemma, and so the homomorphism is injective. □ 



4 Comparison of paths in diagrams 
4.1 Paths in sectors 

We will modify the length function on the words in the generators of the group G. This 
modification is helpful in subsequent subsections. 

The standard length 1 1 * 1 1 of a word (of a path) will be called its combinatorial length. 
From now on we use the word length for the modified length. We set the length of every 
g-letter equal 1, and the length of every a-letter equal a small enough number 5 > so 
that 

5 < (3N)-\ (4.7) 
If a word v has s ^-letters, t a-letters, and no g-letters, then 

\v\ — s + 8 max(0, t — s) 

by definition. For example, the word read between two g-letters of a (q, #)-relation has 
length 1 since it has one ^-letter and at most one a-letter by formulas (13. 4[ 13. 5j) and the 
property (3) of Lemma 12.101 

Arbitrary word w is a product t>o<?iWig2 • • • Qm,v m , where qi, ■ ■ ■ ,q m are g-letters and the 
words t>o, • • .v m have no g-letters. Then, by definition, \w\ = m + X]j=o ^ e l en ffth 
of a path in a diagram is the length of its label. The perimeter \dA\ of a van Kampen 
diagram is similarly defined by a shortest cyclic decompositions of the boundary dA. It 
follows from this definition that for any product s = S1S2 of two words or paths, we have 
l s l < \si\ + I S2 1 , and \s\ = \si\ + | S2 1 if S2 starts or si ends with a g-letter. 

If a path p starts at a vertex o and ends at o', we will write o = p_ and d = p+. 
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Lemma 4.1. Let A be a trapezium bounded by two maximal q-bands C and C and having 
no H -cells. Assume that C has no a-edges. Let o be vertex lying on both C and the top of 
the trapezium A, and d belong C and the bottom of A. Assume that a path t connects o 
and d and has no q-edges. Then the vertexes o and d can be connected in A by a path 
t' such that \t'\ < \t\ and t' = tit 2 tz where t\ and t 3 are parts of the sides of C and C , 
respectively, and t 2 consists only of a-edges. 

Proof. Since the path t has no g-edges, it follows from the assumption of the lemma that 
every maximal #-band T of A has exactly two (9, g)-cells (the first one and the last one). 
Therefore C and C can be connected along T by a path x consisting of a-edges only. 

We denote by t 2 a shortest path among such x-s. Then we define t\ (£3) as the shortest 
subpath of the side of C (of C) connecting o and (t 2 )- ((^2)+ and d). 




Assume that there is an a-bands A starting with an a-edge of t 2 and ending with an 
a-edge e of dC. Then e belongs to some path ye where y consists of a-edges and connects 
C and C. Notice that every maximal a-band crossing the path y must cross t 2 because it 
cannot cross A, and C has no a-edges. Hence 1 2/ 1 « < I ^2 1 a. — 1, contrary the minimality in 
the choice of t 2 . 

Thus every maximal a-band A crossing t 2 must connect the top and the bottom of the 
trapezium A, and therefore the path t must cross every such an a-band A. Also t must 
cross every maximal a-band starting on t 3 whence \t\ a > \t 2 \ a + \ t 3 \ a = \t'\ a . Since the path 
\t\ must cross every maximal #-band of A we also have inequality \t\g > \t1\g-\-\t3\g = \t'\$. 
Now it follows from the definition of path length that \t'\ < \t\ as required. □ 

Lemma 4.2. Let A be an accepted i-sector with index % ^ 1 bounded by two maximal 
ki-band C and k i+ i-band C . Let o\ and o 2 be two vertexes lying on C and C, respectively. 
Assume that o\ and o 2 are connected by a path t, in A. Then the replicas o[ and d 2 of the 
vertexes 0\ and o 2 in the replica A' of A can be connected by a path t' such that \t'\ < \t\. 
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Proof. First of all, one may assume that no one maximal g-band C = Cx, C 2 , ■ ■ ■ , Ck+ 2 = C 
is crossed by the path t twice. Indeed, otherwise t has a subpath s of the form ezf, where 
e and / are g-edges of some Cj separated in this band by m (6, g)-cells for some m > 0. 
Therefore the path z must cross at least m maximal #-bands whence \ezf\ > m + 2. But 
the vertexes e_ and / + can be connected along Cj by a path of length m (see the example 
after the definition of length | * |), and so the path t can be shortened. 

Thus the path t is a product t = t\... t k+ i, where each tj connects a vertex o(j) lying 
on Cj with a vertex o(j + 1) lying on Cj + x, and for every j = 1, . . . , k, either starts 
or tj ends with a g-edge, and so \t\ = J2j=l \tj\- As in the previous paragraph, we have 
that each of tj-s crosses every #-band at most once. (Consider ezf, where e and / are 
#-edges of the same #-band.) Now using notation of Remark I3.1(J[ it suffices to consider 
the replica 1^- of the subsector Tj and find a path t!- connecting the replicas o'(j) and 
o 7 (j + 1), with < 

We may assume that % is odd. (If i is even one should use a mirror argument.) 

We first consider the path t% crossing the input subsector assuming that ti has no 
g-edges, since the g-edges (if any) can be attributed to the subpaths t\ and £3. By property 
(6) of Lemma I2.10[ C 2 has no a-edges. Hence, by Lemma 14.11 applied to a subtrapezium 
of Y 2 containing t 2 , we may assume that t 2 = S1S2S3, where s\ and S3 are the subpaths 
of top or bottom paths of g-bands C 2 and C3, respectively, and s 2 goes along a top or 
bottom of a maximal #-band T. For the both paths S\ and S3 we have paths s[ and s' 3 of 
the same length lying on the boundaries of the g-bands C' 2 and C' 3 of the replica T' 2 and 
connecting the replicas of the vertexes (si)± and (s3)±, respectively. The vertexes (s' 1 ) + 
and (s 3 )_ are either connected by a copy s' 2 of s 2 (if the 6*-band T was copied when we 
constructed the replica A') or (s' 1 ) + = (s' 3 )_ (if the corresponding 6*-band of A' has no 
a-edges). It follows that in any case we have \t' 2 \ < \t 2 \ for t' 2 = s'^s^. 




Assume now that j ^ 2. The subsector Tj has no if -cells by Lemma I3.8[ and so 
it is a union of alternating subtrapezia r^r^, • • • whose histories are words either in 
the alphabet 6 or in 9. Let tj = xx . . . xj, where every x k belongs to some Tj S . If the 
history of the trapezia Tj S is a word over O, then we have the copy T'j s of Tj S in A', and 
a subpath x = x k of tj lying in Tj S has a copy x' in Tj 8 . If the history is a word over 

9, then r^ a has no a-edges, and for every subpath x of tj lying in Tj S , we can construct 
a corresponding subpath x' in r^ s which copies only q- and ^-edges of x, but ignores the 
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a-edges of x. Since there are no a-edges in the common boundaries of neighbor r'- s and 
we nave ( x 'k)+ = ( x 'k+i)- ^ or ever Y k — 1, . . . ,d — 1, and we obtain \t' 2 \ < \t 2 \ for 
the path t 2 = x[ . . . x' d . 

Now the required path t' is obtained, and the lemma is proved. □ 

Assume that we have a minimal diagram T with a cyclically reduced boundary label 
over the group G(S U S,L), which is separated by a maximal fcj-band (or fcj + i-band) C 
in two parts A and A such that A is an accepted i-sector. Assume that every maximal 
6*-band 7i, . . . , T m of T crosses A, the bottom x of 71 is a part of the boundary dT, and 
4>(x) is a suffix of the subword ki-i . . .hi . . . fej+i(a prefix of the subword ki . . . fct+i . . . k i+2 , 
respectively) of the word So. Also we assume that for every j G [2,m], the trimmed 
bottom tbotTJ is a subpath of the top top7J_i. Below we call such a diagram unfinished 
if i, i — 1 1 (if i, i + 1 ^ 1, respectively). 

Lemma 4.3. T7ie part A of the unfinished diagram V can be embedded in an i — 1-sector 
(respectively, i + 1-sector) V which is a mirror copy of the i-sector A. 

Proof. Without loss of generality we assume that C is a /c^-band in the definition of unfin- 
ished diagram T. We construct V by induction on the length m of the history h = h\ . . . h m 
of the z-sector A. 
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Let II be the intersection cell for 7i and C. Then 7i consists of II and two subbands 
7i and 77', where 7i belongs to A and T" belongs to A. Let 11(1)' and 11(1)" be the 
neighbor cells for LT in T[ and in T", respectively. Then 11(1)' (if it exists) is a mirror 
copy of 11(1)" with boundary label in the alphabet Ai-i since these cells are determined 
by the same history hi and mirror g-letters of the word E as this follows from relations 
(I3.4|3.5p . Similar argument shows that if II'(l) has a neighbor IT(2) (where II'(2) ^ II) 
in 7i, then the cell IT"(1) has a neighbor IT"(2) (where n"(2) ^ H") in T", and n'(2) is a 
mirror copy of II" (2) with boundary label in alphabet Ai-i- By induction, we obtain that 
T[ is a mirror copy of a subband of T{' starting with LT. Hence one can extend T[ (and 
the subdiagram A) and obtain a #-band T~i which is a mirror copy of 7J". 

Since the trimmed bottom of the band 72 is a subpath of the top of 71, one can similarly 
subdivide T2 in 11(2), T 2 , T 2 , and prove that T 2 is the mirror copy of a subband of T 2 , 
starting with the cell 11(2) and having the boundary label over the alphabet Ai-i- (T'2 
and T" 2 m ay include (9, g)-cells and also (9, a)-cells.) Therefore there is an extension T 2 
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of T 2 , and this extension is a mirror copy of T 2 ■ Then by induction we construct 0-bands 
7^ v , . . . , T^ 7 , and these #-bands together with the g-band C form the required i — 1-sector 
V. □ 

Lemma 4.4. Let IS. be a minimal diagram with boundary path x\y\x 2 y 2 , where 

(1) 4>{yi) is a subword kj . . . ki of Ho, this subword does not contain the letter k\\ 

(2) x\ and x 2 are sides of the maximal kj-band Kj and ki-band /Q starting on y x ; 

(3) every cell of A belongs to one of the maximal 9-band T\, . . . , T m of A; 

(4) (a) either each of the bands 7%, . . . , T m crosses Kj or (b) each of them crosses /Q; 

(5) the trimmed bottom path of 7% is a subpath ofyf 1 , and the trimmed bottom path of 
Ti is a subpath of the top path of 77— l for every I = 2, ... , m; 

(6) one can construct a diagram A with boundary X\y\X 2 y 2 , and A satisfies the analogs 
of properties (l)-(5), but <f>(yi) = kj-\...ki in case 4( a ) (4>(y~i) = kj...k i+ i in case 
4(b)), and A is embeddable in A so that the kj-band Kj and the ki-band Ki remain maximal 
in A. 

Then in case (4) (a) (in case 4(b)), there exists a diagram A' over the group G\ 
with boundary path x'^x^y^ such that is the subword k 2 j-i . . . kj (respectively, 

ki . . . k 2 i-j) of S 0; x\ and x' 2 are sides of the maximal kj-band Kj and k 2 j-i-band K 2 j-i 
(of the maximal ki-band Ki and k 2i _j-band K 2i _j starting on y[), the label of x\ and x' 2 
are copies of 4>{x\) and<p(x 2 ), resp., and \y' 2 \ < \y 2 \. 

(The subscripts of k-bands are taken modulo L.) 

Proof. We consider the case (4)(b) only. The maximal fc-bands Kj, f£j+i ■ ■ ■ Ki + i subdivide 
diagram A in subdiagrams T/-s, where Ti is bounded by /Q and Ki + i and Ti includes these 
fc-bands (I = j, . . . 

Every maximal ^-band of A having a cell in IY_i, must cross both bands Ki and /Cj+i 
of A. Therefore the parts of these bands in Ti_i and Tj form an unfinished diagram whose 
i-sector is a subdiagram Aj of Tj. By Lemma |4TS| the subdiagram r^-i is embedded into 
the mirror copy Aj_i of Aj. Similarly, Tj_2 is embeddable into a mirror copy of Aj_i 
which is the copy of Aj (we denote this copy by Aj_ 2 ) >•••> is embeddable into the copy 
(or mirror copy) Aj of Aj. 




Let y 2 = ti_i . . . tj, where t\ passes through Y\ (and A;) for I = i — 1, . . . ,j. Since every 
subpath ti connects two vertexes on the fc-bands of the /-sector A;, we can construct 
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the mirror copy A / 2i _ l _ 1 (which is a 2i — I + 1-sector) of A/ or the replica of A; (if 
2i — I — 1 = 1), and the copies of the vertexes (£;)± are connected in A' 2i __ l _ 1 by a path 
t' 2i _i_i with |t 2 i-i-il < by Lemma 14.21 The desired diagram A' embeds in the union 
of these A 2i _,_ r s, and y' 2 = t\... 4-j-i- □ 

4.2 Shortcuts 

In this subsection we show that Lemma 14.41 helps cutting off a hub from a diagram using 
a 'shortcut'. But first consider few simpler statements. 

Lemma 4.5. Let a diagram A over G have a q-band C starting and ending on dA, and 
p is a side of C. Assume that no Q-band crosses C twice in A and there is a factorization 
xy of the boundary path of A such that x_ = p-,x + = p + . Then \p\ < \x\, \dC\ < \dA\, 
and Ixp" 1 ] < \dA\. 

Proof. On the one hand, the length \p\ of p is equal to the number m of (9-cells in C, since 
the every cell of C has one #-edge and at most one a-edge on p by condition 3 of Lemma 
12.101 and the definition of (9, g)-relations. On the other hand, every maximal #-band 
crossing C must terminate on x, since it does not cross C twice. It follows that \x\ > m, 
and so |p| < \x\. Similarly, we obtain inequalities \dC\ < |<9A|, and < \dA\. (We 

take into account the definition of length and the fact that the sides of the band C are 
separated by two g-edges lying on d A.) □ 

A maximal #-band of a diagram A called a rim band if its start and end #-edges as 
well as its top or its bottom lies on the boundary path of A. 



Lemma 4.6. Let A be a diagram over G with a rim band T having at most N (9, q) -cells. 
Denote by A' the subdiagram A\T. Then \dA\ - \dA'\ > 1/2. 

Proof. Let s be the top side of T and s C dA. Note that by our assumptions the difference 
between the number of a-edges in the bottom s' of T and the number of a-edges in s cannot 
be greater than 2N since every (9, g)-cell has at most two a-edges. However, A' is obtained 
by cutting off T along s', and its boundary contains two #-edges fewer than A. Thus one 
can compare the boundaries of A and A' as follows. There is a one-to-one correspondence 
between the g-edges of these boundaries, and to extend this correspondence to the 9- and 
a-edges of intermediate subpaths of the boundaries, one should remove 2 6*-edges from 
dA and add at most 2N a-edges. Therefore it follows from the definition of length and 
inequality (jiTTjl . that |<9A| - \dA'\ > 2(1 - 8) - 2N5 > 1/2 . □ 

Lemma 4.7. Let a diagram A have two cells: an H-cell II and a (6, a) -cell n which 
have a common a-edge e, ep is the boundary of II and efe'f is the boundary of tc, where 
0(e) = 0(e') _1 = a and 4>{f) = </>(f) = 9 for a 9 -letter 9. Then there is a diagram A' 
with the same boundary label as A composed of 'IT and a 9-band T, and p is the side ofT. 

Proof. The letter a commute with 9, and therefore every letter of the boundary label of 
IT commutes with 9. Hence one can construct a #-band T with boundary g'p~ l gp' ', where 
4>(p') = <f){p), and 4>{g) = (t>{g') 1 = 9~ 1 . If we attach the band T to n along the path p 
and remove tt, we obtain the required diagram A'. □ 
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Let II be a hub of a minimal diagram A given by Lemma 13.111 Using the notation of 
that lemma, we recall that the subdiagrams r, and r i+1 intersect along the fc-band Bi+\ 
(i — 1, . . . , L — 5). We denote by \I/ the minimal subdiagram containing all the r r s for 
i — 1, . . . , L — 4. The boundary path of \1/ is x'x", where x' is composed from the sides of 
Bi, Bls, and a subpath of dU, while x" is a subpath of <9A. 

Lemma 4.8. One can construct a minimal diagram ^' over G\ with boundary path x'x 
such that (1) ^/' includes the bands B\ and Bls (2) \x\ < \x"\, (3) the subdiagram ty' has 
no maximal q-bands except for the q-bands B'i (i = 1, . . . , L — 3) starting on x' , (4) every 
maximal 8-band of ^' crosses either the band B\ = B\ or the band B' l-3 = Bl-3, (5) the 
subdiagram ty' has no H- cells between the pair of k-bands B'i and B'i+i, unless this pair 
is a pair of k\- and k 2 -bands. 

Proof. Let be a minimal diagram with boundary of the form x'x which satisfies condi- 
tions (1) and (2) and has minimal \x\. Since ^ satisfies conditions (1) and (2) with x = x' , 
such ty' exists. Clearly the path x has no loops. If the diagram has a maximal g-band 
C which does not start/terminate on II, then one can cut off C and shorten x by Lemma 
14.51 So satisfies condition (3). 

Assume that the diagram ^' does not satisfy condition (4) of the lemma. Then by 
Lemma EH we have a #-band of starting and terminating on x. It follows that there is 
a #-band T starting and terminating on x, such that the subdiagram $ bounded by T and 
a part y of x has no non-trivial #-bands, i.e., it contains only iJ-cells. Two if-cells of $ 
cannot have a common edge since otherwise they can be replaced by one if-cell contrary 
to the minimality of It follows that every if-cell ir of $ has a common edge with T 
because the path x has no loops. 

Thus we have a series of if-cells 7Tl, . . . , tt s in $ with boundaries yiZi (i = 1, . . . , s), 
where yi is a part of y (or yi is empty) and Z{ belongs to the side z of T . If \<!>( z i)\a < 2, 
then ^2 \ z i\ < 25 since every Zi is a product of a-edges. Therefore \z\ < \x\ +25. It follow 
from Lemma \4. 61 that if we remove all 7r r s and then cut off the band T, then we decrease 
the length of x since 25 < 1/2; a contradiction. 

Hence ^ |0(^j)| a > 3, and so at least one of 7Tj-s has a common edge with a (9, a)-cell 
of T . (We recall that T intersects each of the maximal g-bands of ^' starting on x" at 
most once by Lemma I3.4[ and so at most two of the (9, g)-cells of T have a-edges with 
a G Ai, and each of these two (9, g)-cells can have at most one a-edge.) Therefore we can 
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apply Lemma 14.71 to replace the (6, a)-cell by a #-band passing round the cell 71$. This 
modification of the band T decreases the number of if -cells in $, since one of the if-cells 
gets over the band T ■ 

The modified diagram can be non-minimal, but our surgery preserves g-bands and 
keeps the property that every g-band and every #-band have at most one common (6, q)- 
cells. So soon or later, this trick makes the inequality ^2 \(f>(Zi)\ a < 2 true, and one can 
decrease x, as was explained above. If one replaces the obtained diagram by a minimal one, 
then the condition (1) still holds since the maximal g-bands B\ and <Bl_3 are completely 
determined by the boundary as this follows from Lemma 13.41 This contradict to the 
assumption on the minimality of \x\. Thus the condition (4) holds. 

If has if-cells in the subdiagram between the pair of A;-bands B'i and B'i+i, which 
are not a pair of k\- and fc2-bands, then these if -cells (with labels over the alphabet A±) 
cannot have common edges with the maximal #-bands of 1^ since the #-bands of must 
intersect either B'i or B'i + \ by (4). This implies that has no if-cells at all because the 
path x has no loops. The lemma is proved. □ 

Lemma 4.9. If a minimal diagram A has a hub, then the (cyclic shift of the) boundary 
path of A can be factorized as pp' so that the subpath p starts and ends with q-edges (and 
there is a simple path z in A with Z- = p_ , z + = p + such that the subdiagram bounded by 
the loop pz^ 1 has exactly one hub II and the label <p(z) is equal in the group G\ to a word 
of length < \p\. 

Proof. We may assume that a hub II is chosen in A according to Lemma 13.111 Let 
x'x" be the boundary path of \l/ as in Lemma 14.81 We will look for the path z in the 
minimal subdiagram A' obtained after removing of ^ and II from A. Therefore to prove 
the lemma, one may assume using the notation of Lemma [4.81 that = ^ and x = x", 
i.e., the subdiagram \1/ itself has properties (3), (4), and (5) from Lemma 14.81 (We do 
not know if the diagram U II U A' is still minimal but we will use the minimality of \J/' 
only.) 

There is d > such that there exist exactly d maximal #-bands of ^ crossing each of 
the /c-bands Bi, . . . , B^s- This implies that the initial subbands Bi[d] of length d in all 
Bi-s (i = 1, . . . , L — 3) are copies of each other under the shifts of the indexes in their 
boundary labels. 

Let T be the set of remaining maximal #-bands of i.e., every band of T intersects 
exactly one of the bands B%, Bls- It follows from Lemma I3~4l for \l/ that there is an integer 
I (1 < I < L — 3) such that no #-band of T crossing B\ crosses Bi + \ and no #-band of T 
crossing Bls crosses £>/. We have either (L— 3)— I < (L— 3)/2or (l+l) — 1 < (L— 3)/2 since 
L is even. Without loss of generality we choose the former inequality, and so / > (L — 2)/2. 

If Bi is fci-band for some i < 6, then we will consider a smaller subdiagram bounded 
by Bi and £>l_3 instead of (Respectively, we change the complimentary minimal 
subdiagram A'.) If none Bi, . . . ,B$ is a A^-band, then we does not change Thus, in 
any case we can reindex the /c-bands and assume that \1/ is bounded by B\ and B^ r 
for some r < 9 and that the bands B 1; . . . , B r+3 are not fci-bands. Let they be kr,..., 
£^±(r+2)-bands for some i. We will assume that £> r +3 is a fcj_ r _2-band. 

Since L > 40, we have after such a reindexing that / > (L — 2)/2 — 6 > 12 > r + 3, and 
so no (9-band from the set T crossing the band Bls crosses B r+3 . We denote by $ the 
part of the diagram \l/ bounded by £> r +3 and B\. Let 7^*, . . . , be the maximal #-bands 
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of Then by the choice of the sub diagrams \I> and every cell of $ belongs to one of 
these #-bands, and each of these bands crosses the fcj-band B\. We will assume that 7~® 
is the closest band to the hub II, and so on. 

For every j > 1, we have that at least one of the two g-edges of every (9, g)-cell of 
belongs to the top of 7~*i (to <9n if i = 1) because \1/ has no maximal g-bands except 
for the bands starting on II. Assume that a band 7~*, starting with a (9, g)-cell of Bi, 
terminates with an (9, a)-cell 7r having no a-edges on dT^x- Then an a-edge and one 
#-edge of II lie on the boundary subpath x" of and so if one removes tt from the 
length of x" does not increase since [vr j has 2 #-edges and 2 a-edges, and all properties 
(3)- (5) hold for the remaining part of Therefore we may assume that every edge / of 
tbot(7~*) belongs to top(7;* 1 ) (to dU for i = 1). 




To construct the path z, we go along the side of the band B 2 which is closer to £>i, then 
go along the part of the boundary of the hub which is not part of <9\I/, and finally go to 
dA along the side of £>L_ r which is closer to £>£_ r+ i. Thus we have z = Z\Z 2 z 3 according 
to this definition, and respectively, (f>(z) = Z = Z\Z 2 Z 3 . Let p be the subpath of dA and 
d^fr such that p- = z^,p + = z + . Then the boundary path of A is of the form pp', for an 
appropriate p'. 

We denote by T the part of the diagram $ bounded by B r+3 and £> 2 . Let y be the 
common subpath of dT and p with y_ = p_ = z_. We may apply Lemma 14.41 to the pair 
(r, $) and obtain a new diagram V over G\ according to that lemma. One of the four 
boundary sections of V is a subword ki_i...k i+r+ i of the word E . 

Note that Bi- r is a /cj +r+ i-band since we take the indexes of the /c-letters modulo L. 
By Lemma [4.41 V has a loop with label of the form Z\Z2Z'Y' ', where Z' is the copy of 
the word written along the band B r+ 3, and \Y'\ < \y\. Hence Z = (y / ) _1 (Z / ) _1 Z% in G±. 
Therefore to prove that Z is equal in Gi to a word of length < \p\, it suffices to prove 
that the word (Z')~ 1 Z 3 is equal in Gi to a word of length < |se|, where p = yx. 

We observe that the words Z' and Z 3 have equal prefixes of length d since Bi- r [d] is 
a copy of # r+3 [d]. Therefore the word (Z')^ 1 Z 3 is equal to {Z')~ l Z 3) where the [Z ,s )~ x 
copies the label of the part of the side of B r+3 crossed by the 6*-bands from the set T 
only, and Z 3 is the label of the part of z 3 crosses by the #-bands of T only. We denote 
the union of these two subsets of T by T. The length of (Z')~ 1 Z 3 does not exceed the 
number |T| of bands in T since neither of the bands from T crosses both B r+3 and Bl- t - 
By Lemma [3.41 every band of T must end on the subpath x. Hence \(Z')~ 1 Z 3 \ < \x\. In 
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fact this inequality is strict since L — r > r + 3 (as r < 9) and so x must includes some 
g-edges as well. The lemma is proved. □ 



5 Spaces of words 

5.1 Spaces of boundary labels of some diagrams. 

We call a disc simple if it has no if-cells and either all its #-edges have labels from or 
all of them have labels from 0. 

Lemma 5.1. Let A be a diagram having exactly one hub IT. Then there is a diagram A 
with the same boundary label as A such that A has a simple disc subdiagram D, and the 
annular diagram V = A\D is a minimal annular diagram without 8-annuli. 

Moreover, one may assume that the boundary label of D is of the form 
{kiWik,2W2 ■ ■ ■ kiW^ 1 , where fciW^A^W^ • • • ^iW l is accepted by either machine S(L) or 
by S(L), and the lengths of 6-annuli in D do not exceed N + L(space Su g(W2)) ■ 

Proof. We may assume that A is a minimal diagram. Let D\ be a maximal disc subdi- 
agram of A, and denote by /Ci, . . . , ICl the maximal k\, . . . , fc^-bands of D\ starting on 
the hub n. We denote by Tj the maximal accepted z-sector of D\ bounded by ki and fcj+i 
(i = 1, . . . , L). By lemmas [3.81 and [3TW 1 ) . these sectors have no if -cells for i ^ 1 and each 
of them is a copy or a mirror copy of the 2-sector r 2 . 

Now we replace the 1-sector Ti by the replica r' 2 of T 2 in D\. (For these aid, one can 
made a cut along /Ci, /C 2 and the part of the boundary of II between these /c-bands, and 
insert two mirror copies of r' 2 along this cut.) By the definition of replica, we obtain a mod- 
ification D2 of the disc diagram D\ with boundary label of the form fciW / i/c 2 iy 2 . . . kjJW^, 
where W\ is a mirror copy of Wi or W\ has no a-letters. Since A^W 7 ^ is an accepted 
2-sector word, the word fciW / i/c 2 W / 2 . . . kj^Wi is accepted by either machine S(L) or ma- 
chine S(L) by Lemma [3. II Moreover the length of this computation does not exceed the 
length of the computation of S U S by the same lemma. Therefore by Lemma [3.9( 2). the 
disc D2 can be replaced by a disc D3 which has no ii-cells, whose labels of #-edges either 
all belong to or all belong to 0, and whose number of ^-annuli does not exceed that 
number for D 2 . 

Now, if necessary, the annular diagram A\D% can be replaced by a minimal diagram 
T over the group G\. Assume that V has a #-annulus T. By Lemma 13.44 T surrounds 
the disc diagram D 3 , and so D 3 can be included in a larger disc subdiagram D4 which 
contain more (k{, #)-cells for i 7^ 1, 2 since the extensions of the /Q-s have to cross T. 
Then one can make the surgery as above and replace D3 by a larger simple disc D4 This 
procedure terminates, because we do not change the number of (ki, #)-cells {i 7^ 1,2) in 
the compliment of disc when passing from D\ to -D 2 and from Z} 2 to D% and reduce this 
number when passing from D 3 to D 4 . (Recall that the rank of such cells is higher than the 
ranks of other cells in diagrams over G\.) Thus the procedure terminate with a desired 
diagram V. 

Finally, one can replace the disc by a disc corresponding to a computation of minimal 
space and use Lemma 13.11 to make the second claim of the lemma true. 

□ 
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Lemma 5.2. There are positive constants C\ and C2 with the following property. For the 
boundary label w = k\Wi . . . k 2 Wi of the simple disc D from lemma \57T\ there is a sequence 
of elementary transformations (say, simple sequence) w = Wq — >■ wi — >■ • • • — >■ w t = 1 using 
the relations of G(S U S, L) and the hub relation V3. 6}) , such that \wi\ < CiS' s (\W 2 \) + c 2 
(i = 0,l,...,t). 

Proof. Let us start with the words wo = u>(0), w(l), . . . , w(m) = Eq, written on the 
boundaries of the #-annuli of D. By Lemma 15.11 and 13. 1[ there is c\ > such that 
\ w (i)\ < ciiS^dW^I) + N. Notice that w(i) is written on the top of a 9 annulus T of D 
and w(i + 1) is written on its bottom. The band T has N (9, g)-cells. For these cells 
the combinatorial lengths of tops and bottoms differ by at most ±1. The remaining 
cells are (9, a)-cells, and there tops and bottoms have one a-edge. Therefore one can 
insert several elementary transformations between w(i) and w(i + 1) corresponding to a 
sequential removal of the cells of T so that the combinatorial length of the words obtained 
after the refinement of our sequence does not exceed \wi \ + iV + 2. To complete the proof, 
it suffices to set c 2 = 2N + 2. □ 

Lemma 5.3. Assume that (1) a minimal diagram A has no hubs and no q-bands or (2) 
A is a union of a simple disc D and a minimal annular diagram V over G\ surrounding 
the disc subdiagram D and having no 8-annuli, and every maximal q-band of A starts on 
the hub of D. Then the sum perimeters ou of all H-cells in A does not exceed c%\dA\ 
for some constant C3 independent of A. 

Proof. We consider the condition (2) of the lemma only since a simplified argument works 
if A satisfies condition (1). 

Two different if-cells cannot be connected by an a-band in a minimal diagram since 
otherwise this subdiagram can be replaced by a diagram with one if-cell and several 
(9, a)-cells contrary to the minimality of the diagram. (See Lemma 3.12 (2) in [21].) A 
maximal a-band A cannot connect a-edges of the same if -cell 7r in a disc subdiagram 
by Lemma 3.12 (3) in |21j, and also A and 7r cannot surround the disc D since their 
boundaries have no g-edges. Therefore every maximal a-band starting on an if -cell ends 
either on dA or on dD, or on a (9, g)-cell of the annular diagram T. Therefore to estimate 
o~Hi we should give an estimate for the number n$ A of (9, g)-cells in T and for the number 
of a-letters in the word Wi, where k\W\ . . .k L W L is the boundary label of D. (Recall 
that the edges of if-cells are labeled by a-letters from the alphabet A\ and so cannot be 
connected by a-bands with subpaths of dD labeled by W 2 , ■ ■ ■ , Wl.) 

Let T be a maximal #-band of V . Since both the start and the end 9 -edges of T must 
belong to dA, it follows from Lemma [3.41 that T crosses every g-band of F at most once, 
and so has at most iV (9, g)-cells, because every maximal g-band of T starts on the disc 
D. The number of maximal #-bands of T does not exceed dA since T has no ^-annuli. It 
follows that n 9tq < N\dA\. 

Since we have \Wi\ a < \W2\a — \Ws\ a = . . . for the simple disk D, we will look for 
an upper estimate for | PV2 1 a, - Every maximal a-band A starting on the subpath p of dD 
labeled by W 2 cannot ends on p by Lemma 13.41 since the word W 2 is reduced and has no 
^-letters. Therefore A terminates either on one of two closest maximal g-bands Q and Q' 
starting on D or on dA. The lengths of Q and Q' are at most \dA\ as this was explained 
in the previous paragraph, and so each of the sides of these g-bands has at most \dA\ 
a-edges. Therefore \W 2 \ a < 3|<9A|. 
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Assume that a word w vanishes in the group G given by relations (13. 4p . (I3.5p . and (13. 6p . 
Then we denote by Spacedw) the minimal number m such that there is an elementary 
reduction of w to the empty word such that at every step i, we have a tuple of words 
(wn, . . . , Wi ;S (i)) with \wn \ + ■ • • + |ifj )S (j)| < m. For a set W of words vanishing in G (call 
such a set vanishing), we define the space function fc,w( x ) as the maximum of Spacec{w) 
over the words w G W with < x. (Thus we use the length | | here unlike the length 
| || used in Introduction.) 

Now we denote by Wi the set of words read on the boundaries of simple discs, and we 
say that w G W2 if w can be read on the boundary of an fZ-cell. 

Lemma 5.4. The function fa,w 2 i> s bounded from above by a function equivalent to /g,vw 

Proof. Assume that \w\ = n > 1 and w G W2. Since the set of generators ai, a m of H is 
symmetric, for every i < m, we have a positive relation of the form ajCij/ for some i! < m. 
By Lemma 13.21 these 2-letter relations are consequences of the relations of G. Since the 
set of 2-letter relations is finite, there is a constant C4 such that one can letter-for-letter 
convert w in a positive word u of the same length, and the space of the corresponding 
chain of transformations is < n + C4. The word u is a product of cyclic shifts of the words 
£(«) and S(-u) as this was explained at the end of Subsection 13.11 Here both words £(ii) 
and belong to Wi and their lengths are at most Ln + N < (L + N)n. Therefore for 
any w G W2, we have spacec(w) < 2fc,w 1 {{L + N)n) + C4 which implies the statement 
of the lemma. □ 

Lemma 5.5. Let C > 0. Assume that for every w from a vanishing set G W, there is a 
sequence w = wo — )■ Wi —)■•••—)■ w t = 1 sitc/i that for every i — 0, . . . , t — 1, a q/c/ic s/w/it 
0/ i/ie word is freely equal to a product of a cyclic shift of Wi + \ and a word Vi from a 
vanishing set W , where max(|t>j|, \wi\) < C\w\ (i — 0, . . . , t — 1). Then the function fc,w 
is bounded from above by a function equivalent to few- 

Proof. The condition of the lemma implies that we can apply the following series of 
elementary transformations to Wi. First series of transformations replaces the word by 
its cyclic shift, the second series deletes/inserts mutual inverse letters operating with the 
words of length < 2C|w|, then we split the obtained word in a product of a cyclic shift of 
w i+ i and the word v i: then we keep w i+ i unchanged and use the appropriate procedure 
reducing the word V{ to the empty word, and finally obtain the word word w i+ i using cyclic 
shifts. Clearly, we have that the space of this procedure is at most 2C|w| + fG,w{C\w\). 
Thus, by induction on i, we have Spacec{w) < 2Cn + fc t w(Cn) for arbitrary word 
w G W of length at most n. The lemma is proved. □ 

Now we introduce the set W3 of the boundary labels of diagrams A satisfying the 
condition of Lemma i.e., either (1) A is a minimal diagram A having no hubs and no 
g-bands or (2) A is a union of a simple disc D and a minimal annular diagram T over G\ 
surrounding the disc subdiagram D and having no #-annuli, and every maximal g-band 
of A starts on the hub of D. 

Lemma 5.6. The function fa,w 3 is bounded from above by a function equivalent to 

/g,WiUW 2 - 



40 



Proof. We will assume that a word w of length n > 5 is the boundary label of a dia- 
gram A satisfying condition (2) in the definition of the set W3. Every cell of the annular 
subdiagram T is either if-cell or a #-cell. Therefore there is a sequence of diagrams 
A = A , Ai, . . . , A t = D such that for i = 1, . . . , t, the diagram Aj results from Aj_i 
after one cut off either (a) an if -cell or (b) a rim #-band, or an edge e such that ee _1 
belongs to the boundary path of A,_i. The surgery of types (b) and (c) decreases the 
perimeter by Lemma 14.61 Although the surgery of type (a) can increase the perimeter, it 
follows from Lemma [5.31 that the perimeter of every diagram Aj is at most (1 + c^)n. 

Now we have a sequence w = w , wx, . . . , w t , 1 where w , . . . , w t are the boundary labels 
of A , A 1; . . . , A t = D, of lengths at most (1 + c 4 )n, such that, for every i = 0, . . . , t, a 
cyclic shift of the word Wi is a product of a cyclic shift of the word Wj+i and a word 
Vi, where v% is either a boundary label of an if-cell of A or a 2- letter word aa _1 , or 
the boundary label of the simple disc D, or the boundary label of the the rim #-band 
of A,. In all these cases \vi\ < 2(1 + c^jn, in the case of rim band, we obviously have 
Spacec\vi\ < 2(1 + €4)12, and in other cases Spacec\vi\ < /g,Wiuw 2 ((1 + 04)1%). Therefore 
one can apply Lemma 15.51 and complete the proof. □ 

By definition, the set of words W4 contains the set W3 and consists of boundary labels 
of diagrams A, where either (1) A is a minimal diagram having no hubs or (2) A is a 
union of a simple disc D and a minimal annular diagram T over G\ surrounding the disc 
subdiagram D and having no #-annuli. 

Lemma 5.7. The function fc,W4, is bounded from above by a function equivalent to /g,w 3 - 

Proof. Again we assume that a word w of length n > 5 is the boundary label of a diagram 
A satisfying condition (2) in the definition of the set W4. Assume that A has a maximal 
g-band C which does not start or terminate on the simple disc D. Then A is separated in 
3 subdiagrams: Fi contains the disc D, T 2 = C, and T 3 is the remaining part of A. On 
the one hand, the lengths of the top and bottom of C is equal to the number of cells m in 
C since every cell of C has one #-edge and at most one a-edge on each of the sides of C. 
One the other hand, each maximal #-band of A crossing C must start and terminate on 
<9A. This implies that the perimeters of each subdiagram Ti,r2, and T3 are at most 2n. 
(Here we use the definition of length and take into account that the band C starts and 
ends on g-edges.) 

Therefore there is a sequence of diagrams A = A , A 1; . . . , A t of perimeters < In such 
that for i — 1, . . . ,t — 1, the diagram Aj results from Aj_i after one cut off either (a) an 
subdiagram without g-bands or (b) a g-band, and A f has no maximal g-bands except for 
those starting/terminating on the hub of D. Let w = wq, wi, . . . , w t be the boundary labels 
of these diagrams. Every word tUj (i — 0, . . . , t) of the series w = wo,wi, . . . ,w t , w t +i = 1 
(or its cyclic shift) is a product of a cyclic shift of the word tfi+i and a word V{, where Vj 
either belongs to W3 or it is the boundary label of a g-band. In all these cases \vi\ < 2n, 
in the later case we obviously have Spacec\vi\ < 2n, and in former cases spacec\vi\ < 
fc t y\; 3 (2n). To complete the proof, we apply Lemma [5.51 □ 

Lemma 5.8. Let W5 be the set of all words vanishing in G. The space function /G,Ws( n ) 
of the group G is bounded from above by a function equivalent to /g,w 3 uw 4 ( n ) ■ 

Proof. Let w = 1 in G and |«;| = n > 0. If w — 1 in Gi, then Spacea(w) < Spacea,w 3 { w ) 
by Lemma [5\71 Otherwise the minimal diagram A with boundary label w has t > 1 hubs, 
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and by Lemma 14.91 a cyclic shift of the word w = Wq is a product of a word Vi written on 
the boundary of a diagram V having one hub, and a word W\ which is a boundary label 
of a diagram with t — 1 hubs, and |u>i| < n, \vi\ < 2n. By Lemma 15.1 1 V\ G W4. Now 
induction on t gives a series of words wo,w\, . . . ,w t = 1 and words v 1, . . . , v t satisfying 
the conditions of Lemma [531 with C = 2, and our statement follows from that lemma. □ 

5.2 Proofs of main statements. 

Proof of Theorem 11.21 Let a DTM with space function f(n) recognize the language of 
vanishing in H words in the finite set of generators of the group H. By Lemma 2.10 (a), 
there is an ^-machine S recognizing the same language, and the generalized space function 
S' s (n) of S is equivalent to f{n). The group G constructed on the basis of S in Subsection 
13.11 is finitely presented and contains H as a subgroup by Corollary 13.11 The consecutive 
application of lemmas 15.81 15.71 15. 6[ 15. 4[ and 15.21 results inequality /g,w 5 (™) ^ S' s (n). 
Note that the space functions /g,w 5 (^) an d sc(n) of G are equivalent since the lengths 
functions || * || and | * | satisfy inequalities S\\w\\ < \w\ < \\w\ \ for every word w. Hence 
SG{ n ) ^ S' s (n). Since S' s (n) ~ f{n) by Lemma [2. 101 (2). we have Sciji) -< f(n). 

To invert this inequality, we first note that sdn) >z log<i(n), where d(n) is the Dehn 
function for G. (Indeed, up to equivalence, the length t of a rewriting W = (w Q ) 
W t = ( ) without repetitions does not exceed exp(max* =0 ||Wj||); see also Theorem C in 
[7]). Therefore it suffices to show that d{n) y exp(/(n)). By lemma [2T4T 3). S' M ,(n) ~ 
f(n), and by Lemma 12.91 (4), T' s {n) y exp(S' M ,(n)). Thus it remains to explain that 
d{n)yT' s {n). 

Let W be a word accepted by S, such that 1 < \W\ a < n and times (W) = T s (n). 
Then the word V = k^W\ . . . k N _iW^k N ( where Wi are copies or mirror copies of W) is 
accepted by S(L) and therefore it is conjugate of the word S (see subsection 13. ip . Hence 
V — 1 in G. Let A be a minimal diagram over G with boundary label V. Since dA has 
only one fci-edge, the maximal fcx-band starting on this fci-edge must end on the boundary 
of a hub. Hence A has a hub n satisfying to the condition of Lemma [3.111 If one removes 
n together with the bands £>i, ... , Bls and with subdiagrams r\ (i = 1, . . . , L — 4), then 
the remaining diagram A' has at most L — (L — 3) + 3 = 6 fc-edges. It follows from 
Lemma [3. Ill that A' has no hubs since 6 < L — 3. Thus A has exactly one hub. 

Obviously, every /cj-band starting on <9n ends on <9A {i = 1, . . . and so we can 
consider the 2-sector T of A bounded by k2- and k^-bands. By Lemma ISTW l). the number 
of maximal #-bands of V is equal to the length of a computation of S(L) US(L) accepting 
the word W. By Lemma \3.1\ we can replace S(L) US(L) by S in the previous phrase, and 
so area(A) > T' s {n). Since ||V|| < ClH^la for a constant C, we have d{Cn) > T s (n), and 
the required lower bound is obtained. □ 

Proof of Theorem 11.41 Let M be a DTM with space function f(n). Then as in the 
proof of Theorem ll.2[ we can construct an S"-machine S with S' s (n) ~ f(n). But now we 
simplify the construction of the group G: It is defined by the relations associated with 
the machine S(L) only and the hub relation (there is no S(L) now, and so we have no 
if- relations at all). The (simplified) proof of Theorem 11.21 works in this simplified setting 
and therefore sc(n) ~ f(n). □ 

Proof of Corollary 11.51 Let an NTM has an FSC space function f(n) and solves 
the word problem in a finitely generated group H. Then by Savitch's Theorem (see [9], 
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Theorem 1.30), there is a DTM which solves the same problem with space ~ f{n) 2 . It 
remains to refer to Theorem 11.21 □ 

We need one more lemma to prove Corollary ll.61 This is a version of Savitch's theorem 
(see Theorem 1.30 in [9]), but instead of simulation of the work of a NTM by a DTM, 
now we need a DTM computing the space function of given NTM. 

Lemma 5.9. Let M be an NTM with space function S(n) bounded from above by an 
FSC function f(n). Then there is a DTM Mo such that (1) Mq computes S(n), i.e., 
for any input n G N given in binary, it computes S(n); (2) the space function SM (n) is 
0(f(nf). 

Proof. Without loss of generality, we may assume that M satisfies the s*io condition. 

If u is an accepted input word for M and ||w|| < n, then the time of any computation 
(without repetitions) of space < f(n) accepting u is at most < 2 c/ ("), for some integer 
c > since all configurations in this computation are of length < f(n) + cq for a constant 
Co- So the goal for a DTM M' we want to define, is to find a computation C of M of 
minimal space and of length at most 2 C ^™) which connects the input configuration and 
the accept configuration of M, and then to compute the space of this computation. (If 
such a computation exists; otherwise M' says that u Cm-) Indeed, the required machine 
M Q will examine all words u with < n in lexicographical order, will switch on M' for 
every such input word u, and will compare the spaces space a/ (w) of u-s on an additional 
tape keeping only the maximal one after every return. 

For arbitrary words w and w' and k G N, define predicates reach n (w,w' , k) to mean 
that w and w' are configurations of M and there is a computation w — > . . . w' with 
time < k and with space < f(n) + cq. By this definition, M accepts an input word u of 
combinatorial length < n iff reach n (wQ,Wf,2 c ^ n ^), where Wq = w(u) is the unique input 
configuration on input u and Wf is the unique accept configuration of M. 

Note that reach n (w,w',k+j) iff (3w")(reach n (w,w" , k) and reach n (w" , w' , j)) and 
the minimal space of computations w —>■■••—>■ w' is the minimum over all w" of the 
maximums of minimal spaces for w —>••••—>• w" and for w" — > ■ ■ ■ — > w'. This observation 
leads to the following (slight) modification of Savitch's machine. 

• Given n G N, compute 2 c ^ n ^ (in binary) using that f(n) is an FSC function. 

• Then, reach n (w, w', 1) is true if w = w' or w transforms into w' under the application 
of a single command of M. The space of the computation w — > w' is max(|u>| a , |w/| a ). 

• If k > 2, then for all possible configurations w" of M with length < f(n) + cq, 
compute whether it is true that reach n (w,w" , [{k + l)/2]) and reach n (w" ,w' , [(k + 
l)/2)]. Set reach n (w, w', k) to be true iff such w" exists. Find the minimal space of 
computations w —>••••—>■ w' of length < k using the information on the minimal 
space computations w —>■•••—>• w" and w" —»■•••—»■ w' of length < [{k + l)/2]. 

It is easy to see that the constructed machine M' computes, in particular, the space 
of any accepted input word u of length < n. Passing from k to [(k + l)/2], we need an 
additional space to store the information on k, w, w', on the current w", and afterwards, on 
the minimal space of computations w —>••••—>• w' of length < k. Clearly, this additional 
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space is 0(f(n)), and since starting with k = 2 c ^ n \ we divide k by 2 cf(n) times, the 
total space used by M' and by M is 0(f(n) 2 ). 

□ 

Proof of Corollary 11.61 Assume that a is computable with space < 2 2 " 1 . It follows 
that for m = [log 2 log 2 n] we can recursively compute binary rationals a m such that 

\a - a m \ = 0(2- m ) = 0((log 2 n)' 1 ) (5.8) 

and the space of the computation of a m is at most n. In addition, one may assume that 
the number of digits in the binary expansion of a m is 0(m). Therefore the computation 
of [log 2 n] (in binary) and of the product a m [log 2 n] needs space at most 0((log 2 n) 2 ). 
Then we rewrite the binary presentation of [a m [log 2 n]] in unary (as a sequence of 1-s). 
This well-known rewriting (e.g., see p. 352 in [22]) has the space function of the form 
[a m [log 2 n]} + 0(1). One more rewriting of this type applied to the unary presentation of 
[a TO [log 2 n}}, will have the space function of the form 2 <l ° lm ^ og2Tl ^ + 0(1). Using (15.81) . we 
can present this function as 

2a[iog 2 ri]+o(i) _j_ ^ 2 a [ lo s2«] ^ 

Thus the subsequent application of the above mentioned DTM-s has space function 
equivalent to n a , and we can apply Corollary II. 41 to obtain a finitely presented group with 
space function equivalent to n a . 

Now assume that a function [n a ] is equivalent to a space function of a finitely presented 
group G. Then by Proposition 11.11 there is an NTM M whose space function S*m(^) is 
equivalent to [n a ], that is 

cm a < S M (n) < c 2 n a (5.9) 

for some positive ci, positive integer c 2 , and every sufficiently large n. In particular, we 
have 5*A/(n) < c 2 n d for some integer d and every n. Since c 2 n d is an FSC function, we 
may apply Lemma [5791 and obtain a DTM Mq computing the function Sm^) with space 
0(n 2d ). Hence Mq computes the function 5*M(2 2m ) of m with space 0((2 2m ) 2d ). This 
space is less than 2 2 " l+Ci ' for some c 3 . Hence for some c 4 G N, M computes the function 
SmC^ 2 " 1 C4 ) with space at most 2 2 '" . 

Let us plug n = 2 2 ™ C4 to inequalities (I5.9p and then take log 2 of the terms. We obtain 

Ai + «2 m - C4 < log 2 S M {2 2m ~ C4 ) < A 2 + «2 m " C4 
where A, = log 2 q (i = 1, 2). It follows that 

| a - 2- m+C4 log 2 5 A f(2 2m_C4 )| < c2~ m+C4 = 0(2" m ) (5.10) 

where c = max(|Ai|, |A 2 |). Recall that 5*A/(2 2m C4 ) < 2 2 ™ 1 and so this number has at most 
2 m ~ 1 + l binary digits. Therefore the real numbers log 2 (S , M (2 2m C4 )2~ m+C4 are computable 
with error 0(2~ m ) and space 0(2 2m ). Now it follows from (15.101) that the real number 
a is computable with space 2 2 "\ □ 
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